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PREFACE 


Two successive eras of investigations of the foundations of mathematics 
in the nineteenth century, culminating in the theory of sets and the 
arithmetization of analysis, led around 1900 to a new crisis, and a new 
era dominated by the programs of Russell and Whitehead, Hilbert and of 
Brouwer. 

The appearance in 1931 of Godel’s two incompleteness theorems, 
in 1933 of Tarski’s work on the concept of truth in formalized languages, 
in 1934 of the Herbrand-Godel notion of ‘general recursive function’, 
and in 1936 of Church’s thesis concerning it, inaugurate a still newer 
era in which mathematical tools are being applied both to evaluating the 
earlier programs and in unforeseen directions. 

The aim of this book is to provide a connected introduction to the 
subjects of mathematical logic and recursive functions in particular, and 
to the newer foundational investigations in general. 

Some selection was necessary. The main choice has been to concentrate 
after Part I on the metamathematical investigation of elementary number 
theory with the requisite mathematical logic, leaving aside the higher 
predicate calculi, analysis, type theory and set theory. This choice was 
made because in number theory one finds the first and simplest exempli¬ 
fication of the newer methods and concepts, although the extension to 
other branches of mathematics is well under way and promises to be 
increasingly important in the immediate future. 

The book is written to be usable as a text book by first year graduate 
students in mathematics (and above) and others at that level of mathe¬ 
matical facility, irrespective of their knowledge of any particular mathe¬ 
matical subject matter. 

In using the book as a text book, it is intended that Part I (Chapters 
I — III), which provides the necessary background, should be covered 
rapidly (in two or three weeks by a class meeting three times a week). The 
intensive study should begin with Part II (Chapter IV), where it is es¬ 
sential that the student concentrate upon acquiring a firm grasp of 
metamathematical method. 

The starred sections can be omitted on a first reading or examined in 
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a cursory manner. Sometimes it will then be necessary later to go back and 
study an earlier starred section (e.g. § 37 will have to be studied for § 72). 

Godel's two famous incompleteness theorems are reached in Chapter 
VIII, leaving a lemma to be proved in Chapter X. The author has found 
it feasible to complete these ten chapters (and sometimes a bit more) 
in the semester course which he has given along these lines at the Uni¬ 
versity of Wisconsin. 

The remaining five chapters can be used to extend such a course to a 
year course, or as collateral reading to accompany a seminar. 

A semester course on recursive functions for students having some prior 
acquaintance with mathematical logic, or under an instructor with such 
acquaintance, could start with Part III (Chapter IX). There are other 
possibilities for selecting material; e.g. much of Part IV can follow directly 
Part II or even Chapter VII for students primarily interested in mathe¬ 
matical logic. 

The author is indebted to Saunders MacLane for encouraging him to 
write this book and for valuable criticism of an early draft of several 
chapters. John Addison read the entire first printer’s proof with great care, 
independently of the author. Among many others who have been of as¬ 
sistance are Evert Beth, Robert Breusch, Arend Heyting, Nancy Kleene, 
Leonard Linski, David Nelson, James Renno and Gene Rose. Scientific 
indebtedness is acknowledged by references to the Bibliography; especially 
extensive use has been made of Hilbert and Bemays’ "Grundlagen der 
Mathematik” in two volumes 1934 and 1939 . 

July 1952 S. C. Kleene 


Note to the Sixth Reprint (1971). In successive reprints various errors have been 
corrected, the principal corrections being those listed in Jour, symbolic logic vol. 19 
(1954) P- 216 and voi - 33 (1968) pp. 290-291, and: on p. 505 bottom paragraph 
sg((r) 0 ) 'P(( r )i) + Mo*?(Mi) replaced by a function x{P> <?> r ) defined by Theorem 
XX (c); on p. 506 allowance made in the middle paragraph for x possibly occurring 
free in t, and line 5 from below " = ” changed to Moreover, in this sixth 

reprint eleven bibliographical references have been updated (cf. end p. 517) and 
two shoYt notes have been added (on pp. 65 and 316). 
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Chapter I 


THE THEORY OF SETS 

§ 1. Enumerable sets. Before turning to our main subject, it will 
be appropriate to notice briefly Cantor’s theory of sets. 

A flock of four sheep and a grove of four trees are related to each other 
in a way in which neither is related to a pile of three stones or a grove of 
seven trees. Although the words for numbers have been used to state this 
truism on the printed page, the relationship to which we refer underlies 
the concept of cardinal number. Without counting the sheep or the trees, 
one can pair them with each other, for example by tethering the sheep 
to the trees, so that each sheep and each tree belongs to exactly one of the 
pairs. Such a pairing between the members of two collections or 'sets’ 
of objects is called a one-to-one (1-1) correspondence. 

In 1638 Galileo remarked that the squares of the positive integers can 
be placed in a 1-1 correspondence with the positive integers them¬ 
selves, thus 

1, 4, 9, 16, ..., n\ ... 

1, 2, 3, 4 

despite the ancient axiom that the whole is greater than any of its parts. 
Cantor, between 1874 and 1897, first undertook systematically to compare 
infinite sets in terms of the possibility of establishing 1-1 correspond¬ 
ences. 

The two sets in Galileo’s “paradox” and the set of the natural numbers 

0, 1, 2, 3, ..., n — 1 , ..., 

are examples of infinite sets which are ‘enumerable’. Choosing the last 
named as the standard, we define an infinite set to be enntnerable (or 
denumerable or countable), if it can be placed in a 1-1 correspondence 
with the natural numbers. 

To show that an infinite set is enumerable, we need merely indicate 
how its members can be given (without repetitions) in an ‘infinite list’; 
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then the first in the list corresponds to 0, the second to 1, and so on. 
Although the list itself is infinite, each member occupies a finite position 
in the list. 

A particular infinite list (without repetitions) of the members of the 
set, or 1-1 correspondence between the set and the natural numbers, 
is called an enumeration of the set; the number corresponding to a given 
member is the index of the member in the enumeration. 

The members of a finite set can also be given in a list, i.e. a finite list. 
Hence the term enumerable is sometimes applied to sets which are either 
infinite and enumerable, i.e. enumerably infinite, or else finite. 

The set of the integers can be enumerated, by listing them in the 
following order, 

0, 1, -1, 2, -2, 3, -3,.... 

The set of the rational numbers is also enumerable, a fact which is 
surprising if one first compares them with the integers in the usual 
algebraic order. The points on the x-axis with integral abscissas are isolat¬ 
ed, while those with rational abscissas are ‘everywhere dense’, i.e. between 
each two no matter how close there are others. The enumeration can be 
accomplished by a device which we shall present for the positive rational 
numbers, leaving the case of all the rationals to the reader. Let the 
fractions of positive integers be arranged in an infinite matrix, thus, 

Vi V.- 1 /. Vi - ••• 

4 / * / 

2 /l 2 / 2 2 /s 2 / 4 ... 

* / 

3 /i 3 / 2 7 3 3 /4 ... 

4 

4 /l 7 2 Vs V 4 ••• 


Then let these fractions be enumerated by following the arrows. A rational 
number is one which can be expressed as a fraction of integers. Go 
through the' enumeration of fractions striking out each one which is 
equal in value to one that has preceded it. This leaves the following 
enumeration of the positive rational numbers, 

1, 2, V 2 , Vs. 3, 4, */ 2 . 2 /s. 7 4 . . 
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The device of the matrix constitutes a general one for enumerating the 
ordered pairs of members of an enumerable set, e.g. the ordered pairs of 
natural numbers, or the ordered pairs of integers. The rows of the matrix 
are the enumerations of the pairs with the first member of the pair fixed. 
The ordered triples of members of an enumerable set can then be enumer¬ 
ated by another application of the matrix, taking as the rows the 
enumerations already won of the triples with the first member of the 
triple fixed. Successively we can win enumerations of the ordered 
n-tuples of members of an enumerable set for each fixed positive integer 
n. All of these enumerations, including the enumeration of the original 
set, can be taken as the rows of a new matrix to obtain an enum¬ 
eration of the ordered ^-tuples for variable n, i.e. the finite sequences of 
members of an enumerable set. 

This result can be applied to obtain an enumeration of the algebraic 
equations 

a 0 x n -f apt"- 1 + ... + + a„ = 0 (a 0 ^ 0) 

with integral coefficients, since each equation can be described by giving 
the sequence 

(<*o> ®i» • • • > &n— 1> ^n) 

of its coefficients. A ‘(real) algebraic number’ is a real root of an equation 
of this sort. Since a given equation has at most n different roots, the 
algebraic numbers are enumerable. 

Another device will illustrate the possibilities for enumerating sets. 
In dealing with an enumerable set (finite or infinite), the numbers which 
correspond to the members in some specified enumeration can be used 
to designate or name the members individually. Now conversely, if a 
name or explicit expression can be assigned to every one of the members 
of a set individually, in a preassigned and unambiguous system of nota¬ 
tion, the set is enumerable (finite or infinite). We stipulate that a name or 
expression shall be a finite sequence of symbols chosen from a given 
finite alphabet of available symbols. For example, the algebraic equations 
with integral coefficients can be written using decimal notation for the 
coefficients and exponents. The raised exponents are an inessential feature 
of the notation, which can be removed by a suitable convention. Indeed, 
so long as we are dealing only with these equations, we may simply write 
the exponents on the line. The symbols required are then precisely 

0, 1, 2, 3, 4, 5, 6 , 7, 8, 9, x, +, -, =. 
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The first symbol in an equation is not a 0. Reinterpret these symbols as 
the digits (!) in a quattuordecimal number system, i.e. a number system 
based on 14 in the same way that the decimal system is based on 10. 
Every equation becomes a natural number (distinct equations becoming 
distinct numbers). Enumerate the equations in the order of magnitude of 
these numbers. 

§ 2. Cantor’s diagonal method. That there are infinite sets con¬ 
sidered in mathematics which cannot be enumerated was shown by 
Cantor’s famous ‘diagonal method’. The set of the real numbers is 
non-enumerable. 

Let us first consider the real numbers x in the interval 0 < x <, 1. 
Each real number in this interval is represented uniquely by a proper 
non-terminating decimal fraction, i.e. a decimal fraction having its first 
significant digit to the right of the decimal point, and having infinitely 
many digits that are not 0. A number may have a terminating decimal 
fraction, i.e. one with repeating 0’s, but that fraction is replaceable by a 
non-terminating fraction with repeating 9’s. For example .483 or 
.483000... can be replaced by .482999-Conversely every proper non¬ 

terminating decimal fraction represents a unique real number in the 
interval. 

Now suppose that 

*0> *1> *2> *3» • • • 

is an infinite list or enumeration of some but not necessarily all of the 
real numbers belonging to the interval. Write down one below another 
their respective non-terminating decimal fractions, 


•*oo 

*01 

\ 

*02 

*03 

*10 

*11 

*12 

\ 

*13 

•*20 

*21 

*22 

\ 

*23 

•*30 

*31 

*32 

*33 


Select the diagonal fraction shown by the arrows. In this change 
each of the successive digits x nn to a different digit x' nn , but avoid 
producing a terminating fraction. Say, let x’ nn = 5 if x nn ^ 5, and 

*n» = 6 if *»» = 5 - 
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The resulting fraction 

t t t t 

•^00 ^11 X 22 '*'33 • • • 

represents a real number x which belongs to the interval but not to the 
enumeration. For the fraction differs from the first of the given fractions 
in the tenths place, from the second in the hundredths place, from the 
third in the thousandths place, and so on. 

Hence the given enumeration is not an enumeration of all the real 
numbers in the interval. An enumeration of all the real numbers in the 
interval is non-existent. 

To apply the diagonal method to the real numbers without restriction 
to the interval 0 < x <; 1 , it is only necessary to represent the real numbers 
in the characteristic-plus-mantissa form, e.g. 37.142... = 37 + .142, 
— 2.813... = — 3 + .186..., and to apply the method to the man¬ 
tissas. 

It is clear that an essential difference has been revealed between 
the set of the rational numbers or the set of the algebraic numbers on the 
one hand, and the set of the real numbers on the other. 

It is interesting historically to note how Cantor’s discoveries in 1874 
(see the bibliography) illuminated an earlier discovery of Liouville in 1844. 
Liouville had been able to construct by a special method certain tran¬ 
scendental (i.e. non-algebraic) real numbers. Cantor’s diagonal method 
makes the existence of transcendental numbers apparent from only the 
very general considerations presented above. In fact, to any given 
enumeration x 0 , x v x 2 , x 3 , ... of the algebraic numbers, particular 
transcendentals can be obtained by the diagonal method. 

The (real) transcendental numbers are not enumerable. For if they 
were, like the algebraic numbers, enumerations of the two sets could 
be combined to produce an enumeration of all the real numbers. Thus, 
in a sense, most real numbers are transcendental. 

Another example of a non-enumerable set is the set of the (single¬ 
valued) functions for which the independent and dependent variables 
each range on an enumerable set. For definiteness, consider the set of the 
functions of a natural number taking a natural number as value (or infinite 
sequences of natural numbers ). Suppose an enumeration is given of some 
but not necessarily all of them, 

/o(»). /i(»). /«(»). /»(»). 

Write the sequences of the values of the successive functions one below 
another, as the rows of an infinite matrix. 
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m 

/•(i) 

/o(2) 

/o(3) 

\ 



m 

/i0) 

/i(2) 

/i(3) 



\ 


m 

/.(o 

/•( 2) 

/«(3) 



\ 

/s(0) 

/.(o 

/.(2) 

m 


\ 

Take the sequence of values given by the diagonal. Change every one of 
these values to a different value, say by adding 1. The function /(») with 
the resulting sequence of values, which we may write 

/(«) = /„(») + 1 , 

cannot belong to the enumeration, since it differs from the first of the 
enumerated functions in the value taken for 0, from the second in the 
value taken for 1, and so on. 

To phrase the argument differently, suppose that the function f(n) 
were in the enumeration; i.e. suppose that for some natural number q, 

m = /,(«) 

for every natural number n. Substituting the number q for the variable 
n in this and the preceding equation, 

m = tM= fM + 1 . 

This is impossible, since the natural number f Q {q) cannot equal itself in¬ 
creased by 1. 

Still another example of a non-enumerable set is the set of the sets 
of natural numbers. (But the set of the finite sets of natural numbers is 
enumerable. Why?) We can represent a set of natural numbers by a 
representing function, which takes the value 0 for a natural number 
belonging to the set and the value 1 for a natural number not belonging 
to the set. The sequence of the values of the representing function of a 
set of natural numbers is an infinite sequence of 0’s and 1 's. For example, 
the sequence for a set containing 0, 2 and 3 but not 1 and 4 starts out 
0 10 0 1 .... These sequences are taken as the rows of the infinite matrix. 
The alteration performed on the diagonal is the interchange of 0’s and 1 ’s. 

Can these, several non-enumerable sets be placed in 1-1 corres¬ 
pondence with one another, and are there still other types of infinite 
sets ? The reader may profit by attempting to answer these questions 
himself (answers are given in § 5). We shall now look at Cantor’s theory in 
its general formulation. 
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§ 3. Cardinal number. Cantor’s theory of ‘abstract sets’ deals with 
sets in general. (He gave also a theory of 'point sets’.) Cantor describes 
his terms set and element as follows. “By a ‘set’ we understand any col¬ 
lection M of definite well-distinguished objects m of our perception or our 
thought (which are called the ‘elements’ of M) into a whole.” (1895 p. 481.) 

The sets include the empty or null or vacuous or void set which has no 
elements, and the unit sets which have a single element each. We write 
the empty set as O; the unit set with sole element a as {a}', and the set 
having a, b, c, ... as its elements as {a, b, c, . ..}. 

A set may also be called an aggregate or collection or class or domain or 
totality. That a is an element of M may also be expressed by saying that a 
is a member of M or belongs to M or is in M, or in symbols, a 8 M. If a is 
not an element of M, in symbols, atM. 

We understand two sets M and N to be the same (in symbols, M — N), 
if they have the same elements; i.e. if for every object a, a 8 M if and only 
if a £ N. 

Two sets M and N are said to be equivalent (in symbols, M ~ TV), 
if there exists a 1-1 correspondence (§ 1 ) between them. (Sometimes 
we say “the correspondence M ~ N” to refer to a particular 1-1 
correspondence between M and N, which must exist if M ~ N.) 

The relation M ~ N evidently possesses the ‘reflexive’, ‘symmetric’ 
and ‘transitive’ properties, i.e. for any sets M, N and P: M ~ M. If 
M ~ N, then N ~ M. If M ~ N and N ~ P, then M ~ P. 

The cardinal number of a set M is introduced as an object M which 
is associated in common with all and only those sets (including M itself) 
which are equivalent to M. By this definition: M = N, if and only if 
M~N. 

What cardinal numbers are, further than this, is perhaps immaterial; 
but we may notice several interpretations. Cantor describes them thus: 
“The general concept which with the aid of our active intelligence results 
from a set M , when we abstract from the nature of its various elements 
and from the order of their being given, we call the ‘power’ or 'cardinal 
number’ of M.” This double abstraction suggests his notation “M” for 
the cardinal of M. Frege 1884 and Russell 1902 identify the cardinal 
number M with the set of the sets equivalent to M ; while von Neumann 
1928 chooses from each of these sets of sets (‘equivalence classes’) a par¬ 
ticular set to serve as the cardinal of any set in the class. 

The notion of a ‘part’ of a collection is introduced by the following 
definition. A set M x is a subset of a set M (in symbols, M x c M), if each 
element of M x is an element of M. 
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Example 1 . The set {a, b, c) of three elements a, b, c has eight (= 2 3 ) 
subsets: O, {a}, { 6 }, {c}, {a, b), {a, c}, {b, c}, {a, b, c). 

Note that the subsets of a set M include the vacuous set O, and the 
set M itself. The latter is the improper subset, and the other subsets are 
■proper. Evidently, if M 2 C M 1 and M x C M (abbreviated M 2 c M x C M), 
then M 2 C M. 

The union or sum M -j- N of two sets M and N is the set of the objects 
belonging to at least one of M and N (i.e. belonging to M or to N); and 
their intersection or product M • N is the set of the objects belonging to 
both of M and N (i.e. belonging to M and to N). Similarly for more than 
two sets. The difference M — N of M and N (when N c M, also called 
the complement of N with respect to M) is the set of the objects belonging 
to M but not to N. 

Example 2. {a, b, c } + {b, d} = {a, b, c, d}, {a, b, c} • { b, d} = {&}, 
{a, b, c } — {b, d } = {a, b, c } — {6} = {a, c }. 

Evidently M — M x c M ; and if M x c M, then (and only then) 
M x (M — M x ) = M. Two sets M and N are disjoint, if they have no 
common elements, i.e. if M • N = 0. For example, M x and M — M x are 
disjoint sets. If M and N are disjoint, either M ^ N or M = N — O. 

We turn to the important question of comparing cardinal numbers. 
Given two sets M and N, it may or may not be possible to put M into 
1-1 correspondence with some subset N x of N. Vice versa, there may 
or may not exist a subset M x of M which is equivalent to N. Combining 
these two pairs of alternatives gives four cases, exactly one of which must 
apply to any given pair of sets M and N: 


(la) For some N x , 

M ~N x cN; 

but for no 

M x , 

N ~M X CM. 

(lb) For no N x , 

M ~N x cN; 

but for some 

M x , 

N ~M x c.M. 

( 2 ) For some N x , 

M ~N x cN; 

and for some 

M x , 

N ~M X CM. 

(3) For no N x , 

M ~N x <zN; 

and for no 

M x , 

N ~M X CM. 


In Case (la), the cardinal of M is said to be less than the cardinal of N 
(in symbols, M <N). To justify considering < as a relation between the 
cardinals M and N, and not merely one between the sets M and N, we 
must observe that if M' ~ M and N' ~ N, then Case (1 a) applies to the 
pair of sets M', N' if and only if it applies to the pair M, N. 

The order relation for cardinals is transitive, i.e. for any three cardinals 
M, N, f: If < Kf and ft <_P, then M < P. = 

We define M to be > ft, if ft < M. Then M > N exactly in Case (lb). 
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The relationship M — N, i.e. M ~ N, evidently falls under Case ( 2 ) by 
taking as N l and M 1 the improper subsets. Hence, for any two cardinals 
M and N, the three relationships M < N, M = N and M > N are ‘mu¬ 
tually exclusive’, i.e. not more than one of them can hold. 

It does not appear till an advanced stage of the theory (references in 
§ 5) whether they are ‘exhaustive’, i.e. whether at least one of the three 
must hold. The situation is partially clarified by the next theorem, 
after which the question remains only whether Case ( 3 ) can arise. 


*§ 4. The equivalence theorem, finite and infinite sets. Theo¬ 
rem A. If M ~ N x C N and N ~ M x c M, then M ~N. In other words: 
In Case (2) of § 3 , M = (F. Bernstein 1898.) 


Proof. By the hypotheses, we may suppose given a particular 1-1 
correspondence M ~ N 1 between M and the subset N 1 oi N; and similarly 
N ~ M v Our problem is to find a third 1-1 correspondence M ~ N. 

Let A 0 = M — M 1 . In the given correspondence M rL N v the elements 
of the subset A 0 of M will correspond to elements forming a subset B 1 of N x 
(and hence of N), or in symbols A 0 ~L B v Then in the other given corres¬ 
pondence N ~ M v the elements of the subset B t of N will correspond to 
elements forming a subset A 1 of M 1 (and hence of M ), or in symbols 
JBi ~ ^4 1 ; and so on. Thus 

A 0 '-L Bi A 1 ri^ B% ~ A 2 B s Ag .... 

The situation may be grasped by picturing M and N as mirrors by which 
the part A 0 of M outside M x is reflected back and forth to produce an 
infinite succession of images A v A 2 , A 3 , ... in M and B v B 2l B z , ... in 
N, as shown in the figure. (The sets M, M x and N are represented by 



the parts of the horizontal lines to the right of the labels “M”, “M x ’ 
and “N” ; the sets A 0 , B v A v ... by the intercepted segments.) 

Let A = A 0 + A x + Ag + A z +...; i.e. A is the subset of M con¬ 
taining the elements which fall in A 0 or in any of its images A v A 2 , 
A z , ... in M. Also let B = B x + B z + B z + ...; i.e. B is the subset 
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of N containing the elements which fall in any of the images B 1 , B 2 , 
B 3 , ... of A 0 in N. 

To obtain the 1-1 correspondence MrtN, we state a rule which 
determines to each element m of M a corresponding element n of N, 
and prove that the resulting correspondence is 1-1 between M and N. 

Rule. Consider any element m of M. Either m belongs to the subset 
A, or m does not belong to A, i.e. m belongs to M — A . If m belongs to A, 
the corresponding element n of N shall be that which corresponds to m in 
the correspondence M ~ N v If m belongs to M — A (in which case m 
belongs to M x ), the corresponding element n of N shall be that to which m 
corresponds in the correspondence N ^ M v 

The resulting correspondence is 1-1 between M and N, for: 

(a) To different elements m of M, say m x and m 2 , there correspond 
different elements n x and n 2 of N. This is clear when m x and m 2 both 
belong to A or both to M — A. But it is also so when nt 1 £ A and 
m 2 £ M — A, since then n x £ B and n 2 £ N — B. 

(b) Each element of N corresponds to some element m of M. Namely, 
the elements of B all correspond to elements in A, and the elements of 
N — B all correspond to elements in M — A. 

The method of bringing M and N into 1-1 correspondence may be 
visualized as a shifting in the above picture of each of the parts A 0 , 
A v A 2 , A 3 , ... of M one position to the right, so that A 0 takes the place 
of A v A x of A 2 , A 2 of -4 S ,_This changes N M x into N ,-L M. 

Corollary A. If M cN, then M <L N. 

(M <, N means: M < N or M = N.) For if M C N, then either Case 
(la) or Case ( 2 ) applies with M as the N v 

The cardinal number of the empty set O we call 0. (Note: M' ~0 
onlyjf M' = O.) The cardinal number of any set N + {a} where at N we 
call A 7 + 1. (Note : For a given set N -f { a } with a £ N, a set M' ~ iV-J-{a} 
if and only if M' — N' + {a'} where a' ? N' and N' •—■ N.) 

Regarding the natural numbers 0, 1 , 2 , ...,«,»+ 1 , ... as a sequence 
of objects already known to us, the two definitions just stated correlate 
to each natural number n a respective cardinal number which we also 
write n. We call these cardinals finite cardinals, and sets which have these 
cardinals finite sets. The following two propositions will be proved in 
Example 1 § 7. 

(1) For each natural number n, the finite cardinal n is the cardinal of the 
set of the natural numbers which -precede the natural number n in the usual 
order of the natural numbers', or in symbols, n = {0, 1,2 ,..., n — 1}. 
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(2) If M — n (for a natural number n) and M ~ M x c M, then M x — M. 
Thus: A finite set is not equivalent to any proper subset of itself. 

From these two propositions it is not hard to show that the equality 
relation m = n and order relation m < n as determined for finite cardinals 
by the definitions of § 3 agree with the familiar equality and order relation 
for the natural numbers (in particular, we do have n < n + 1 for finite 
cardinals). Thus no confusion will result from identifying the natural 
numbers with the finite cardinals when we choose to do so. 

A set which is not finite we call infinite, and its cardinal an infinite or 
transfinite cardinal. The cardinal number of the set of the natural numbers, 
and therefore of every enumerably infinite set (§ 1), we call R 0 (read “alef 
null”). 

Corollary B. If n is a finite cardinal, n < R 0 . 

Proof. Because n is the cardinal of the subset {0, 1,2, ..., n — 1 } 
of the natural numbers, by Corollary A, n < H 0 . Assume, contrary to 
the corollary, that « = N 0 . But n + 1 is also a finite cardinal, so similarly 
n + 1 < R 0 , which with n = N 0 gives n + 1 <; n, contradicting n < n + 1 . 
Hence the assumption n = N 0 is untenable, and the remaining alternative 
n < R 0 is established. 

Theorem B. An infinite set M has an enumerably infinite subset. 

Proof. t M is not empty, since otherwise it would have the finite 
cardinal 0. Thus M has an element a 0 . Then M — {a 0 } is not empty, since 
otherwise M would have the finite cardinal 1. Thus M has another element 
a v Continuing thus, we select distinct elements a 0 , a,, a 2 , a 3 , ... corres¬ 
ponding to the natural numbers 0, 1, 2, 3, .... which proves the theorem. 
If P is the set M — {a 0 , a v a 2 , a 3 , ...} of the elements of M not selected, 

M = P + {a 0 , a v « 2 , a 3 , ...}. 

Corollary A. If M is an infinite cardinal, then R 0 < M. 

By the theorem with Corollary A Theorem A. 

Corollary B. An infinite set M is equivalent to a proper subset of 
itself. 

For M (expressed as above) is equivalent to its proper subset 
M — {a 0 } = P + {a v a 2 , a 3 , a,„ .. .}. 
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This with (2) above was proposed by Dedekind 1888 as an alternative 
definition of the distinction between finite and infinite sets. (We see that 
the property observed in Galileo’s “paradox” is characteristic of infinite 
sets.) 

Corollary C. The cardinal number of an infinite set M is unchanged by 
the introduction of a finite or enumerably infinite set of elements. 

For new elements b 0 , b v b z , b 3 , ... can be introduced thus, 

M -f- {fr 0 , b v b 3 , b 3 , ...} = P + {a 0 , b 0 , a v b v ..,}. 

Inversely, the corollary says that the removal of an enumerable set of 
elements from a set does not change the cardinal, provided the resulting 
set M is infinite. If the original set is non-enumerable, the resulting set 
must be infinite, as otherwise there would be an obvious enumeration of 
the original set. Thus: 

Corollary D. The cardinal number of a non-enumerable set is un¬ 
changed by the removal of a finite or enumerably infinite set of elements. 

*§ 5. Higher transfinite cardinals. The first of the theorems of 
this section is a formulation in general terms of the situation which we 
met in the last example of § 2 . The reader may also find it instructive to 
experiment with the theorem or its lemma for the case that M is a small 
finite set. The second of the theorems is a generalization of the situation 
encountered in Corollary B of Theorem A. 

We take advantage of the equivalence theorem, via its Corollary A, 
to simplify the presentation of the proofs. The theorems however can be 
proved, with only slight modifications in the argument, without the use of 
the equivalence theorem. 

Lemma A. If S is a set of subsets of M, and M ~ S, then there is a 
subset T of M which does not belong to S. 

Proof, by Cantor’s diagonal method. A subset of M is defined when it 
is determined which of the elements of M belong to the subset. This can 
be arranged by stating a general criterion, which, for any element m of M, 
determines whether that element belongs to the subset or does not belong 
to the subset. We now give a criterion of this sort to define T. 

Criterion. In the 1-1 correspondence given by the hypothesis 
M S, any element m of M corresponds to an element S of S. But S 
is one of the subsets of M. Therefore either m belongs to S, or m does not 
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belong to S. If m belongs to S, then m shall not belong to T. If m does not 
belong to S, then m shall belong to T. 

Now suppose, contrary to what is to be shown, that T belongs to S. 
Select that element of M, call it m 1 , which corresponds to T in the 1-1 
correspondence M ~ S. 

Does m 1 belong to T ? We apply the criterion, with as the m. Since 
Wj corresponds to T, the S of the criterion is now T. The criterion gives a 
contradiction, either if m 1 belongs to T, or if m i does not belong to T. 

The supposition that T belongs to S thus leads to absurdity. Hence, by 
the method of reductio ad absurdum (in which the negation of a proposition 
is proved by deducing a contradiction from the proposition), we conclude 
that T does not belong to S. 

If M is a given set, then the set of the subsets of M, i.e. the set of which 
the elements are (all) the subsets of M, is designated as UM (“U” from 
the German "Untermenge”). 

Theorem C. For any set M, M < UM- (Cantor’s theorem.) 

Proof. If N 1 is the set of the unit subsets of M, then M ~ N 1 C UM. 
Hence by Corollary A Theorem A, M — N x <, UM. Suppose, contrary 
to the theorem, that M — UM, i.e. M ~ UM. Then UM would satisfy 
the conditions for S in the lemma. By the lemma, there would be a subset 
T of M which does not belong to UM. This is absurd, since UM is the set 
of all the subsets of M. Therefore the remaining alternative M < UM 
must hold. 

If we take as the M of the theorem a set with the transfinite cardinal N 0 , 
we discover sets UM, UUM, ... which have greater and greater transfinite 

cardinals. These new cardinals are denoted by 2**°, 2 2 ** # ,_(In fact, for 

any set M, the cardinal of UM is denoted by 2 M . Note that this accords 
with the usual arithmetic when M is finite.) 

Lemma B. If Sis a set, and M is a set of subsets ofS, andto each member 
M of M there is another member M' of M such that M < M ', then M < S 
for every member M of M. 

Proof. Since M C S, by Corollary A Theorem A, M < 5. Assume, 
contrary to the lemma, that M = S. But similarly, M' < S, which with 
M — § gives M' < M, contradicting M < M' . Hence the assumption 
M — 5 is false, and the alternative M < 5 holds. 

If M is a set of which the members are sets, then the set of (all) the 
objects each of which belongs to some member M of M is called the sum 
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of the sets belonging to M and is designated by ©M. The set of the objects 
each of which belongs to every member M of M is called the intersection or 
product of the sets belonging to M and is designated by ®M (“®” from the 
German “Durchschnitt”). These notions are the same as were introduced 
in § 3 , except that now they are expressed as operations on the set M of 
the sets M which are added or multiplied. For example, M + N = 
©{M, N}, M -N = ${M, N}. 

Theorem D. If M is a set of sets, and if to each member M of M there 
is another member M' of M such that M < M', then M < ©M for every 
member M of M. 

Proof. From the definition of ©M, every element M of M is a subset 
of ©M. The theorem now follows from the lemma with ©M as the S of the 
lemma. 

By this theorem, the sum of the sets M, VIM, UUM, ... which have the 
increasing transfinite cardinals X 0 , 2 K », 2 2Xo , ... is a set having a trans- 
finite cardinal greater still than any of those cardinals. This set can be 
used by Theorem C to start a new ascending series. This hierarchy extends 
indefinitely. 

More will be found on Cantor’s theory of abstract sets in Cantor 1895-7, 
Hausdorff 1914 or 1927, or Fraenkel 1928 or 1953, for example. There is 
a cognate branch of the theory dealing with “ordinal numbers”. The 
“comparability theorem for cardinal numbers”, which asserts that 
M < N, M = N and M > N are exhaustive (end § 3 ), appears as a cor¬ 
ollary of the “well-ordering theorem” of Zermelo 1904 (cf. e.g. Hausdorff 
1914 or 1927 p. 61 , or Fraenkel 1928 p. 205 ). For a brief account of the 
celebrated "continuum problem”, which deals with the question whether 
any cardinal lies between N 0 and 2 X », see Godel 1947. 

We have begun with Cantor’s theory for two quite opposite reasons. 
First, some of the ideas and methods which will prove basic later appear 
in it in their original and simplest form. Second, the theory, pursued too 
far, reveals logical difficulties, which are a point of departure for our 
main investigation. This will appear in Chapter III. 

Examples. Sets of cardinal 2N». This is the cardinal assigned to the 
set of the subsets of the set of the natural numbers, which we described 
in § 2 as the set of the sets of natural numbers. There we represented the 
elements of the set by the infinite sequences of 0 ’s and 1 's. The 0 ’s and 
l’s can be interpreted as the digits in a dual (or dyadic) number system, 
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i.e. a number system based on 2 as the decimal system is based on 10, so 
that we have the set of the proper dual fractions. Using Theorem B 
Corollary D to withdraw the terminating fractions, which are enumerable, 
we obtain the proper non-terminating dual fractions. These represent 
1-1 the real numbers x in the interval 0 < x < 1. From the proper 
non-terminating dual fractions we also obtain 1-1 the infinite se¬ 
quences of natural numbers or functions of a natural number taking 
a natural number as value, by coordinating to a fraction that func¬ 
tion f(n) for which /(0) = the number of 0’s before the first 1 in the 
fraction, /(1) = the number of 0’s between the first 1 and the second 1, 
and so on (for example, the function n 2 corresponds to the fraction 
.101000010000000001...). 

Now omit from the interval 0 < ^ < 1 the number x — 1 , leaving the 
real numbers x in the interval 0 < x < 1. A function y = f(x) is easily 
found which, as x ranges over this interval, takes as value y exactly once 
each of the real numbers, e.g. the function y = cot tzx. Removing the 
rational numbers, the (real) irrational numbers are left; or removing 
the algebraic numbers, the transcendental numbers. In Cartesian 
coordinate geometry, the real numbers are coordinated to the points 
of the real Euclidean line. This set is the ‘linear continuum’, and ac¬ 
cordingly the cardinal 2**° is the ‘power of the continuum’. 

Next we can proceed as follows to obtain the set of the ordered 
pairs of real numbers, or regarding a pair (x, y) as Cartesian coor¬ 
dinates in the plane, the points of the real Euclidean plane. Under the 
equivalence already obtained between the real numbers and the infinite 
sequences of 0’s and 1 ’s, any two real numbers x, y correspond respectively 
to sequences of 0’s and 1 ’s 

x 0 x x x z x 3 ..., 
yo yi y 2 y& * *.» 

which can be combined into a single sequence 

*o y<> *i yi x 2 y 2 *3 y 3 ---> 

corresponding to a single real number. Conversely, any single sequence 
breaks up into a determinate pair of sequences under this method of 
combination. Similar procedure gives the n-tuples of real numbers or the 
points of real Euclidean n-dimensional space for any fixed positive integer 
n, and even the infinite sequences of real numbers or the points of real 
EaCltdean N 0 - dimensional space. This last example is treated by using the 
method of § 1 to combine N 0 sequences of 0’s and 1 ’s 
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into a single sequence 

*00 *10 *01 *02 *11 *20 *30 *21 X 12 X 03 • • • 

in which each member ot each of the given sequences has a determinate 
position. 

For any one of the real continuous functions of a real variable, 
all the values of the function are determined by the continuity property 
as soon as the values are given for the rational values of the independent 
variable. These values can be given as an infinite sequence of real numbers, 
by following the order of the rational numbers in some fixed enumeration 
of the latter. Therefore by Theorem A Corollary A, the set of these func¬ 
tions has at most the cardinal 2 X ». But also it must have at least this 
cardinal, and hence exactly this cardinal, since the constant functions 
constitute a subset with the cardinal. 

Sets of cardinal 2 2 **\ This is the cardinal of the sets of sets of 
natural numbers. From the equivalence between the sets of natural 
numbers and the real numbers or the points in ^-dimensional or No- 
dimensional space, it follows that the sets of real numbers and the point 
sets in real Euclidean n-dimensional or N 0 - dimensional space have this 
cardinal. The real functions of a real variable can be represented by their 
graphs, which are point sets in the plane, and hence the set of them has 
at most the cardinal 2 2 *°. It has exactly this cardinal, since those of the 
functions which take only 0 and 1 as values are the representing func¬ 
tions of the sets of real numbers, and so constitute a subset with the 
cardinal. If we extend geometric terminology to this example, we have 
the set of the points of real Euclidean 2^-dimensional space. 



Chapter II 


SOME FUNDAMENTAL CONCEPTS 

§ 6. The natural numbers. The purpose of this chapter is to bring 
together, partly for reference and partly for closer inspection, some of the 
ideas and methods of mathematics. 

When we write the natural number sequence 

0 , 1 , 2 , 3 , ..., 

we rely on the dots “... ” to suggest the continuation of the sequence 
beyond the several members shown. 

Kronecker remarked ( 1886 ), “God made the integers, all the rest is the 
work of man.” We cannot expect that the cognizance of the natural 
number sequence can be reduced to that of anything essentially more 
primitive than itself. 

But by elaborating upon what our conception of it comprises, we may 
succeed in making clearer the bases of our reasoning with the natural 
numbers. 

We begin by describing the natural numbers as the objects which can 
be generated by starting with an initial object 0 {zero) and successively 
passing from an object n already generated to another object n + 1 or n' 
(the successor of «). 

Here we conceive of it as possible, no matter how far we have already 
gone to reach n, to go the one step further to reach n'. The use of the accent 
notation “ri” instead of the more familiar “n + 1” emphasizes that ' is 
a primitive unary operation or function used in generating the natural 
numbers, while + can be defined at a later stage as a binary operation or 
function of two natural numbers. 

To obtain the natural numbers with the usual notations, it remains 
only to explain 0 , 1 , 2 , 3 , ... as standing for 

0, O', 0”, 0'”,..., 

respectively. This is a matter of detail concerning decimal notation. 

In the foregoing description, we have evoked the conception of a 
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succession of discrete steps. These consist in starting with 0 , and proceed¬ 
ing repeatedly from a number n to the next n'. The description can be 
broken into several clauses, as follows. 

1 . 0 is a natural number. 2 . If n is a natural number, then n' is a natural 
number. 3 . The only natural numbers are those given by 1 and 2. 

In this format, the succession of discrete steps becomes an application 
of Clause 1 and a succession of applications of Clause 2. The three clauses 
together constitute an example of what we call an inductive definition. 
The term (‘natural number’) which is being defined is italicized. The 
clauses except the last, which provide instances of the term being defined, 
are called direct clauses', the last clause, which says that the only in¬ 
stances are those provided by the preceding clauses, is the extremal clause. 

Not stated in this inductive definition is the condition for distinctness, 
namely that numbers generated by applications of Clauses 1 and 2 in 
distinct ways should be distinct objects. This can be separated into two 
further propositions. 

4 . For any natural numbers m and n, m' = n' only if m = n. 5 . For any 
natural number n, n' ^ 0. 

Also it is understood that ' is a univalent operator or single-valued 
function, so that conversely to 4 : For any natural numbers m and n, 
m' = «' if m = n. 

To see that Propositions 4 and 5 do require the distinctness of every 
two differently generated numbers, we can reason as follows. Suppose 
that at a given stage in the generation of the numbers, all the numbers 
0 , 1 , .. n so far generated are distinct. Then the next one generated n’ 
must be distinct from the successors 1, ..., n among those previously 
generated (by 4 ) and from 0 (by 5 ). So each successive step in the gener¬ 
ation produces a new number. 

For example, 0 "" # 0", as may also be seen thus. By 4 applied with 
O'" as the m and 0 ' as the n, 0 "" = 0 " only if O'" = O'. By 4 again, 
O'" = 0 ' only if 0 " = 0 . But by 5 with 0 ' as the n, 0 " ^ 0 . 

These five propositions 1 — 5 , with one difference, were taken by 
Peano (1889,1891*) as axioms characterizing the natural number sequence. 
Peano stated Proposition 3 instead as the principle of mathematical 
induction (§ 7 ), and placed it fifth on his list, 4 and 5 being moved up to 
third and fourth, respectively. 

Here we are not considering what the natural numbers are intrinsically, 
but only how they form the natural number sequence. A particular nat¬ 
ural number is to be recognized as the object occupying a particular 
place in the sequence. In other words, a particular number is given when 
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its generation under the inductive definition is given. For example, the 
natural number 4 is given as that object which is obtained by starting 
with the initial object 0 and applying the successor operation ' once, 
again, again and again; or briefly, 4 is given as 0 '"'. A number such as 
872656 in decimal notation could in principle be exhibited by applications 
of' to 0, though in practice we do not do so. 

Of course when we deal with propositions such as that a certain equation 
has two roots, we further employ the assignment of the natural numbers 
as cardinal numbers of finite sets (§ 4 ). 

Order. Under the inductive definition of the natural numbers, they 
are generated in a certain order (the familiar one). Thus we define m 
to be < n, if m is generated before n in the course of generating n. Dis¬ 
secting this, we have the following inductive definition of the relation 
m < n (where m, n range over the natural numbers). 

01 . m < m'. 02 . If tn < n, then m < n'. 03 . m < n only as required 
by 01 and 02. 

When this definition is read, for a fixed m, as an inductive definition 
of the class of the numbers n greater than m, it has the form of the original 
inductive definition of the natural numbers, with m' replacing 0. 

§ 7 . Mathematical induction. Let P be a property of natural 
numbers. Suppose that: 

(1) 0 has the property P. 

(2) If any natural number n has the property P, then its successor 
«' has the property P. 

Then: Every natural number n has the property P. 

This is the principle of mathematical induction. We can state it a little 
more briefly using “n” as a natural number variable and “P(n)” as a 
notation for the proposition that n has the property P: If (1) P( 0 ), and 
(2) for all n, if P(n) then P{n'), then, for all n, P{n). 

The justification of the induction principle is almost immediate, when 
the natural numbers are conceived as the objects generated under the 
inductive definition 1— 3 of § 6. Suppose we have a property P for which 
(1) and (2) hold. Must then every natural number n have the property P? 
We interpret an affirmative answer to mean simply that, if any natural 
number n were given to us, we could be sure that that n has the property 
P. But a natural number n is given precisely when (actually or in principle) 
we are given its generation under the inductive definition, by starting 
with 0 and applying an exhibited number of times the successor operation '. 
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Under these circumstances, we can use (1) and ( 2 ) to conclude that n, 
has the property P. For example, P( 4 ) holds because 4 is given as 0'"'; 
by (1), P( 0 ); thence by ( 2 ), P( O'); by (2) again, P( 0 "); by ( 2 ) again, 
P(0"'); and by (2) again, P( 0 ""). 

Otherwise expressed, (1) and (2) are tools which enable us, while we 
are generating the natural numbers by Clauses 1 and 2 of the inductive 
definition, at the same time to verify for each number as we generate it 
that it has the property P. 

This reasoning of course depends on the extremal clause 3 of the in¬ 
ductive definition. Conversely, the principle of induction can be used to 
prove Clause 3 , by applying it with the following proposition as the P(n): 
n is given as a natural number by Clauses 1 and 2 , i.e. can be generated by 
starting with 0 and applying the successor operation 

In connection with a proof by mathematical induction, we use the 
following terminology. The proposition P(») depending on a variable 
natural number n we call the induction proposition ; and the variable n 
the induction variable or induction number or the variable on which the 
induction takes place. The part of the proof which consists in establishing 
(1), i.e. the proof that P(0), we call the basis of the induction. The part 
which consists in establishing (2), i.e. the proof that if P(n) then P(«')> 
we call the induction step. Within the induction step, the assumption 
P(«), from which we deduce Pin'), we call the hypothesis of the induction. 

Sometimes, in order to carry through the induction step, it is necessary 
to assume as hypothesis of the induction, not simply P(n) but that P(m) 
for all m <. n. The reader may satisfy himself that the induction principle 
is valid in this modification, called a course-of-values induction. Induction 
may be applied to the proof of a proposition depending on a positive 
integer instead of a natural number, in which case the basis consists in 
proving P(l). 

The student encounters mathematical induction in elementary algebra 
courses. Formulas for summing progressions are often given as examples 
of propositions to be proved by induction which are not obvious before 
the proofs have been given. Many propositions which we commonly take 
for granted depend on induction when explicitly proved; and in other 
cases an induction step is so simple that it is passed off with the phrase 
“and so on” or the like (e.g. Theorems A and B § 4 ). 

Example 1 . Prove the propositions (1) and (2) of § 4 by induction 
on n. We do so for (2), leaving (1) to the reader. The induction proposition 
is: For any sets M and M v if M — n and M ~ c M, then M 1 = M. 
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Basis : n — 0 . Let M and M x be sets such that M = 0 , i.e. M — O, and 
O ~M X c O. Then M x = O. Induction step. Assume the induction 
proposition as stated (as hypothesis of the induction). Now let M and M x 
be sets such that M = n 1, i.e. M — N + {«} where N — n and a IN, 
and N + {a} ~M 1 c N + {a}. We must prove that then M x — N -f- {a}. 
In the given 1-1 correspondence N -f {a} ~Mj, the element a of 
N + {«} corresponds to some element b of M x . Then N ~ M x — {6} c 
(iV + {a}) — {6}. Moreover (N + {«}) — {&} ~ N. Hence (A + {a}) —{6} 
= n and (N {a}) — {6} ~ M x — {6} c (N + {a}) — {b}. By the 
hypothesis of the induction, applied using (N + {a}) — {6} as the M and 
M x — {6} as the M x , M x — {b} = (N + {a}) — {6}. Hence (since b £ M x 
and b 8 N + {a}), M x — N + {a}. 

Example 2 . In mathematical formulas parentheses are introduced in 
pairs to show which way the parts of the formula should be associated. 
In complicated cases different species of parentheses such as ( ), { }, [ ] 
may be employed; and very complicated cases may be avoided by various 
abbreviations. However, the question exists in principle whether, using 
one species of parentheses only, the association of a formula is unam¬ 
biguously fixed by its parentheses. (The question is equivalent to a ge¬ 
ometrical one concerning nesting of intervals.) 

To make the question precise, suppose we have 2 n parentheses, n of 
them being left parentheses (, and n of them right parentheses ), and that 
they occur in linear order from left to right. This is the way they would 
occur in a mathematical formula, the other symbols of the formula which 
we need not notice now being interspersed among them. 

We say that two pairs of parentheses separate each other, if they 
occur in the order ( { { } )* ),-, where the i’s identify either pair and the j ’s 
the other, and other parentheses may occur interspersed among the four 
shown. 

We define a 1-1 pairing of the n left parentheses with the n right 
parentheses (briefly, a pairing of the 2 n parentheses) to be proper, if a left 
parenthesis is always paired with a right parenthesis to the right of it, 
and if no two of the pairs separate each other. 

It is almost immediate that if 2 n parentheses are properly paired, on 
removing any of the pairs, the remaining parentheses are properly paired. 
Also the parentheses included between a given pair in the proper pairing 
of the 2 n parentheses are properly paired. 

The following three lemmas contain the answer to the proposed question 
and some related information. 
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Lemma 1 . A proper pairing of 2 n parentheses (n > 0) contains an 
innermost pair, i.e. a pair which includes no other of the parentheses between 
them. 

Prove by a course-of-values induction on ». One may choose to consider 
n either a positive integer or a natural number. If the latter, the basis is 
vacuously true, i.e. true because its hypothesis is not satisfied. (Hint: 
Under the induction step, the leftmost parenthesis will be a left paren¬ 
thesis (,, and this with its mate ) f either themselves constitute an innermost 
pair, or else include a set of parentheses lying between them to which 
the hypothesis of the induction can be applied.) 

Lemma 2. A set of 2 n parentheses admits at most one proper pairing. 


Prove by a (simple) induction on n. (Hint: Under the induction step, 
by Lemma 1 the given parentheses contain an innermost pair. With¬ 
drawing this, the hypothesis of the induction applies to the set of the pa¬ 
rentheses remaining.) 


Lemma 3. If 2 n parentheses and a consecutive subset of 2 m of them both 
admit proper pairings, then the proper pairing in the subset forms a part of 
the proper pairing in the whole set, i.e. each parenthesis of the subset has the 
same mate in both pairings. 


Prove by induction on m. 

For illustration, consider the 22 parentheses 
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A proper pairing, indicated by the subscripts, is discovered by the fol¬ 
lowing 'algorithm' (suggested by the proof of Lemma 2 ): at each stage, 
proceeding from the left, search out the first innermost pair among those 
not already used, and let this pair belong to the pairing. By Lemma 2, it 
would be futile to search for any other proper pairing than this. The third 
to the twelfth parentheses are a consecutive subset, in which a proper 
pairing has already been established in the process of pairing the whole 
set. By Lemma 3 , it would be futile to search for any consecutive subset 
admitting a proper pairing other than one of those already properly paired 
in pairing the whole. 


§ 8. Systems of objects. By a system 5 of objects we mean a (non¬ 
empty) set or class or domain D (or possibly several such sets) of objects 
among which are established certain relationships. 

For example, the natural number sequence (§ 6) constitutes a system of 
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the type ( D , 0 ,') where D is a set, 0 is a member of the set D, and ' a unary 
operation on a member of the set D. Another simple type of system is 
(D, <) where D is a set and < is a binary relation between members of 
the set. 

When the objects of the system are known only through the relation¬ 
ships of the system, the system is abstract. What is established in this 
case is the structure of the system, and what the objects are, in any 
respects other than how they fit into the structure, is left unspecified. 

Then any further specification of what the objects are gives a rep¬ 
resentation (or model) of the abstract system, i.e. a system of objects 
which satisfy the relationships of the abstract system and have some 
further status as well. These objects are not necessarily more con¬ 
crete, as they may be chosen from some other abstract system (or even 
from the same one under a reinterpretation of the relationships). 

Several representations of the abstract natural number sequence are 
(a) the natural numbers as cardinals of finite sets, (b) the positive integers 
(1 representing the abstract 0), (c) the even natural numbers (+2 re¬ 
presenting the abstract '). (d) Commercial products are sometimes pack¬ 
aged in containers which carry advertizing matter including a picture of 
the container itself. Physically, the picture must be limited in accuracy. 
But if we suppose perfect accuracy, we can represent 0 by the container, 
1 by the picture of the container on the container, 2 by the picture of 
the container in the picture of the container on the container, and so on. 

Two representations of the same abstract system are {simply) isomorphic, 
i.e. can be put into a 1-1 correspondence preserving the relationships. 
More precisely, two systems (D lt 0 1( 'j) and (Z) 2 , 0 2 , ' 2 ) of the type ( D , 0,') 
are simply isomorphic, if there exists a 1-1 correspondence between 
D x and D 2 such that 0 X corresponds to 0 2 (in symbols, 0 X —>- 0 2 ), and 
whenever m x * — m 2 then m x \ <—► m 2 Two systems (D x , < x ) and 
iPv <t) of the type (D, <) are simply isomorphic, if there exists a 1-1 
correspondence between D x and D 2 such that, if m x < —s>- m 2 and n x <—► n 2 
then: m x < x n x if and only if m 2 < 2 n 2 . 

Conversely, any two simply isomorphic systems constitute represen¬ 
tations of the same abstract system, which is obtained by abstracting 
from either of them, i.e. by leaving out of account all relationships and 
properties except the ones to be considered for the abstract system. 

As a second example of an abstract system of the type ( D , 0, '), let 
D have just two (distinct) objects 0 and 1, and let 0 ' = 1 and 1' = 0. 
We call this system the residues modulo 2. The natural number sequence 
becomes this when each number is replaced by its remainder after division 
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by 2 (i.e. is reduced mod 2), thus, 

0 , 1 , 0 , 1 , 0 , 1 . 

(Systems of residues were first considered by Gauss, 1801 .) 

As a third example, let S consist of two sequences 

0, 1, 2, 3 , to, co -j- 1, co -f- 2, co -f- 3, .. ., 

each by itself of the same structure as the natural numbers, and with 
no member of either sequence in the relation of successor to a member 
of the other. 

We can modify each of these three examples obviously to consider it 
as a system of the type ( D , <). In the third example, we then take the 
elements in the order shown; and we call them the ordinals < 2co (from 
Cantor’s theory of ordinal numbers). 

The residues mod 2 (or a representation of them) are not isomorphic 
with the natural numbers (or a representation of them), since it is im¬ 
possible to establish a 1-1 correspondence. The ordinals < 2co are not 
isomorphic with the natural numbers, for it is not possible in establishing 
a 1-1 correspondence to preserve the successor operation ' (or the order 
relation <). 

In this section, we are writing “S” for a system and “D” for its set of 
objects, in the case of systems having one set. The notation can often be 
simplified without confusion to use one letter for both. It can e.g. in 
speaking of the natural numbers N as above. It cannot e.g. in speaking of 
the system ( N , <) consisting of the natural numbers, with the even 
numbers (odd numbers) ordered among themselves as usual, and all the 
even numbers preceding all the odd numbers. (This system is a repre¬ 
sentation of the ordinals < 2o>.) 

Systems of objects are introduced in mathematics under two con¬ 
trasting methods or points of view (cf. Hilbert 1900). 

The genetic or constructive method is illustrated by the inductive defi¬ 
nition of the natural numbers (§6). There we conceived of the natural 
numbers as being generated or constructed in a certain orderly manner. 
(This did not prevent our treating them abstractly.) 

In the axiomatic or postulational method, on the other hand, some 
propositions, called axioms or postulates, are put down at the outset as 
assumptions or conditions on a system S of objects. The consequences of 
the axioms are then developed as a theory about any existing system S 
of objects which satisfies the axioms. 

To illustrate, we can take as the axioms Peano’s five axioms. To 



§8 


SYSTEMS OF OBJECTS 


27 


make the point clear, let us rewrite the Peano axioms substituting 
“member of D” for “natural number”. 

PI. 0 6 D. P2. If n&D, then n'ZD. P 3 . If mZD and n £ D, then 
m' — n' only if m — n. P 4 . If n S D, then n' ^ 0. P 5 . If P C D and 
( 1 ) OS P and (2) whenever n £ P then n' £ P, then P — D. 

We already know that exactly one abstract system S satisfies these 
five axioms, namely the natural numbers which we previously introduced 
from the genetic standpoint. 

But from the axiomatic standpoint, we can equally well consider other 
lists of axioms, for example Pi—P 4 . Then S can be the natural numbers, 
or the ordinals < 2co, or any one of many other abstractly differing, i.e. 
non-isomorphic, systems. 

If instead the axioms are Pi —P 3 , P 5 , then the different abstract 
systems which satisfy are precisely the natural numbers and the systems 
of residues mod m for each positive integer m. 

Next suppose we not merely remove P 4 , but substitute for it: 

P6. If n £ D, then n' # n but n" = n. 

Now again exactly one abstract system satisfies, the residues mod 2 . 

The six axioms Pi—P6 together are satisfied by no system S at all, 
since only the natural numbers satisfy Pi—P 5 and only the residues mod 
2 satisfy Pi—P 3 , P 5 , P6. 

The axioms of an axiomatic theory are sometimes said to constitute 
an implicit definition of the system of the objects of the theory; but this 
can only mean that the axioms determine to which systems, defined from 
outside the theory, the theory applies. Then three cases arise. The axioms 
may be satisfied by no system of objects (e.g. Pi—P6); or by exactly 
one abstract system, any two systems which satisfy being isomorphic 
(e.g. Pi—P 5 , or Pi—P 3 , P 5 , P6); or by more than one abstract system, 
i.e. non-isomorphic systems exist which satisfy (e.g. PI—P 4 , or PI—P 3 , 
P 5 ). In the first case we may call the set of axioms vacuous ; in the other 
two non-vacuous, and furthermore in the second categorical (Veblen 1904) 
and in the third ambiguous. (In the genetic method, on the other hand, the 
generation process is ordinarily intended to determine the abstract 
structure of the system completely, i.e. to constitute a categorical def¬ 
inition of the system.) 

It may be by no means evident for a given axiomatic theory which 
of the three possibilities is the case. This is illustrated historically by the 
example of Euclidean geometry without Euclid’s parallel postulate, on 
which depends the theorem that through a given point not on a given line 
there passes exactly one line parallel to the given line. From Euclid’s 
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“Elements” (c. 330—320 B.C.) until the discovery of a non-Euclidean geome¬ 
try by Lobatchevsky ( 1829 ) and Bolyai ( 1833 ), it was generally supposed 
that the axioms are categorical; or at least if the question had been asked 
in these terms, it would probably have been so answered. 

The Greek’s belief that they were dealing with a unique structure of 
space was not formulated in the present terminology. Euclid thought of 
his axioms as expressing certain fundamental properties of real space. 
The axiomatic method in this older sense, wherein the objects of the system 
S are supposed to be known prior to the axioms, may be distinguished as 
informal or material axiomatics. In this, the axioms merely express those 
properties of the objects which are being taken initially as evident from 
their construction, or in the case of theories applying to the empirical 
world as abstracted directly from experience or as postulated about 
that world. 

The axiomatic method as described above, wherein the axioms are 
prior to any specification of the system 5 of objects which the axioms 
are about (and serve to introduce or “define implicitly” the S), was 
first developed systematically in Hilbert’s "Grundlagen der Geometrie 
(Foundations of Geometry)” (1899), and may be distinguished as formal 
or existential axiomatics. We note that it is only from outside a formal 
axiomatic theory (i.e. in some other theory) that one can investigate 
whether one, or more than one, or no abstract system S satisfies the 
axioms. Within the formal axiomatic theory, the domain D for S plays 
the role of a fixed and completed set of objects, assumed as existing all 
at once apart from any order of generation, to which the operations, 
relations, etc. of S apply. 

For a system S of the type ( D , 0,'), 0 and ', or D, 0 and ', are then 
called the primitive or technical or undefined notions, i.e. they are un¬ 
defined prior to the introduction of the axioms. The other terms in the 
axioms are ordinary or logical or defined, i.e. their meanings must be pre¬ 
viously understood. About D, 0 and ', it has only to be understood in 
advance that D is a set, 0 an object belonging to D, and ' an operation on 
a member of D ; i.e. only the grammatical categories to which “D”, “0” 
and “ ' ” belong are defined in advance. Similarly for a system of the form 
( D , <), the undefined notions are <, or D and <. 

In mathematical practice there is often an interplay between the genetic 
and axiomatic methods of introducing systems of objects, as when an 
example of a system of objects satisfying the axioms is provided genet¬ 
ically. At other times an example may be drawn from another formal 
axiomatic theory. (In either case, as soon as the S for a given formal 
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axiomatic theory is identified with a system of objects provided from 
outside the theory, we have an application of the formal axiomatic theory, 
in which application it becomes a material axiomatic theory.) 

The formal axiomatic method is often used to advantage with am¬ 
biguous axiom systems, so as to develop simultaneously a common 
portion of theory for many different systems. The example of ‘groups’ 
in algebra is celebrated. 

As another example, consider the following axioms for linear order, 
which apply to systems of the type (D, <). 

Ll. If m < n and n <p, then m < p. L2. At most one of m < n, 
m—n and m>n holds. L 3 . At least one of m < n, m~n and m > n holds. 

Here m > n means n < m. The variables m, n, p refer to any elements 
of D. These axioms are satisfied taking as D the natural numbers, the 
ordinals < 2co, the integers, the rational numbers, or the real numbers, 
and as < the usual order relation for the same; and by many other 
systems. Omitting L 3 , we have a set of axioms for partial order. 

*§ 9. Number theory vs. analysis. Arithmetic or number theory 
may be described as the branch of mathematics which deals with the 
natural numbers and other (categorically defined) enumerable systems 
of objects, such as the integers or the rational numbers. A particular such 
system (or the theory of it) may be called an arithmetic. The treatment 
is usually abstract (§ 8). The objects are usually treated as individuals 
(i.e. they are not analyzed as composed out of other objects), except e.g. 
when the fundamental properties of non-negative rational numbers are 
being developed by representing them as ordered pairs of natural numbers. 

In arithmetic in the narrower sense one is mainly concerned with par¬ 
ticular operations called + (addition) and (multiplication), or also a few 
other related operations. In arithmetic in the wider sense or number theory 
a wider fund of concepts is employed. 

These definitions are given to clarify our terminology. Sometimes 
"arithmetic” is encountered referring to the theory of + and • for systems 
of numbers that are not enumerable (e.g. the ‘arithmetic of transfinite 
cardinals’). 

While the cardinal numbers of the systems studied in arithmetic or 
number theory are No (or sometimes finite), analysis on the other hand 
deals with the real numbers and other systems of objects having the car¬ 
dinal 28 o (or sometimes a higher cardinal). As with number theory, the 
systems of objects employed in analysis are usually taken to be cate¬ 
gorically determined. 
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Results of analysis are sometimes applied in number-theoretic in¬ 
vestigations, which then constitute analytic number theory. Number 
theory without help from analysis is pure or elementary number theory. 

We now examine briefly the fundamental system of objects for analysis, 
namely the continuum of the real numbers. 

The theory of real numbers which is currently used as the basis for 
analysis (except by critics of its foundations) is the product of an earlier 
critical movement initiated by Gauss ( 1777 — 1855 ), Cauchy ( 1789 — 
1857 ) and Abel ( 1802 — 1829 ). ' 

This led late in the nineteenth century to the arithmetization of analysis, 
so called, by Weierstrass ( 1815 — 1897 ), Dedekind ( 1831 — 1916 ), Meray 
( 1835 — 1911 ) and Cantor ( 1845 — 1918 ). Reliance on somewhat vague 
geometrical intuitions was replaced by a definition of the real numbers 
as certain objects constructed out of natural numbers, integers or rational 
numbers. The properties of the real numbers were thereby reduced ul¬ 
timately to properties of natural numbers. As Poincar6 said in 1900, 
“Today there remain in analysis only integers or finite or infinite systems 
of integers, interrelated by a net of relations of equality or inequality.'' 

The definition of the real numbers from natural numbers, integers 
or rationals can be given in several ways. All lead to the same abstract 
structure of the real number continuum. In other words, what each of 
the definitions accomplishes is to provide a representation (§ 8) of the real 
numbers by objects constructed (directly or indirectly) out of natural 
numbers. 

We have used the representations by the infinite decimal or dual fractions 
(§§ 2 , 5 ). In principle any one of the sets proved equivalent to these (§ 5 ), 
e.g. the sets of natural numbers, could be used, but in practice one will 
choose a representation which makes it simple to define the properties 
of the real numbers. 

A representation which makes the ordering of the real numbers es¬ 
pecially perspicuous is that by Dedekind cuts (1872). Suppose the rational 
numbers R have been separated into two non-empty classes X v X 2 such 
that every rational in X x is < every rational in X 2 . Such a separation is 
called a Dedekind cut (in R). In case there is neither a greatest rational in 
the lower set X x nor a least in the upper X 2 , the cut is called open. Dede¬ 
kind’s insight was that irrationals are called for exactly where the open 
cuts occur. A rational goes with either of two closed cuts, one for which it 
is the greatest in X v and the other for which it is the least in X 2 . In order 
to have a unique representative of each real number (rational or irra¬ 
tional), we can use the lower sets X 1 of the cuts for which X x has no great- 
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est. This gives us the following definition (writing x in place of X v and 
R — x in place of X 2 ). 

A real number is a set x of rationals such that: 

(a) Neither x nor R — x is empty, (b) x contains no greatest rational, 
(c) Every rational in x is < every rational in R — x. 

The set C of reals is the set of all such sets x of rationals. 

This definition makes use of the presupposed system R of rationals 
to construct the representatives of the reals, without taking R into the 
resulting system C as a subsystem. (If the members of R are individuals, 
the members of C are sets of those individuals.) 

We now define a real number x to be rational, if R — x has a least 
member x, in which case x is said to correspond to the rational x (of the 
system R). Otherwise, x is irrational. 

The rationals among the reals form a subsystem C R of C which is 
isomorphic (§ 8) to the original system R of rationals, as we verify each 
time we use the representation to define a notion for the reals which 
has previously been defined for the rationals. 

Examples. The real 2 is the set of the rationals < the rational 2, 
to which it corresponds. The real \/2 is the set of the rationals which are 
either negative or have squares < the rational 2 (among which there is 
no greatest). Since the square of no rational — 2 (as Pythagoras discovered 
in the sixth century B.C.), R — y /2 consists of the positive rationals 
having squares > 2 (among which there is no least), so y /2 is irrational. 

The order relation for reals is defined thus: x < y, if there exists a 
rational r which is in y but not in x. (Now prove that C is linearly ordered 
by <, and that (C R , <) is isomorphic to (R, <).) 

A real number v is an upper bound of a set M of real numbers, if v > x 
for every real number x belonging to M. 

(A) If a non-vacuous set M of real numbers has an upper bound, it has 
a least upper bound u (= l.u.b. M). 

Proof. We must construct u as a set of rationals having Properties 
(a) — (c). We are given M as a set of such sets of rationals. The definition 
of the set u is this: a rational rZ u when and only when, for some real 
number x which £ M, r £ x. In the symbolism of § 5 , u = ©M. It is left 
to the reader to show now that u = l.u.b. M. (Prove that u is a real 
number, u is an upper bound of M, and M has no upper bound v < u.) 

Lower bounds are defined similarly. If the real x is rational, let 
x = x -f- {%}; otherwise, let x = x. Let — x be the set of the rationals 
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-r for r£i?-x. (If x is rational, then — x corresponds to — x.) 
Let — M be the set of the reals — x for x8 M. If w is a lower bound of 
M, then — w is an upper bound of — M, so — M has a l.u.b., and 
- (l.u.b. - M) = g.l.b. M. 

Given reals x and y, let x + y be the set of the rationals r -f- s for 
r 8 x and s 8 y ; let x — y = x + (— y) ; and let |x| = x if x > 0 and 
|x| = — x if x < 0. (Do not confuse + and — with addition and 
subtraction of sets, which are written -f- and —.) 

Given an infinite sequence a 0 , a v ..., a n , ... of reals and a real 
a, we say that lim a n = a, if for every real e > 0, there is a natural 
number n t such that, for every n > » e , |a n — a| < e. For example, 
lim l/2 n = 0 (where l/2 n is the real corresponding to the rational l/2 n ). 

(B) If u = l.u.b. M (as in (A)), there exists a sequence a 0 , aj, ..., 
a„, ... of members of M such that lim a n = u. 

Proof. Let M n = the set of the reals which 8 M and are > u — 1 / 2 ”. 
(Prove that M n is not empty.) Let a n be any real chosen from M„. 
(Prove that lim a n = u.) 

Notwithstanding that in this theory analysis is “arithmetized”, the 
distinction between arithmetic and analysis remains sharp, in that 
analysis finds it necessary to employ infinite sets of the objects of arithme¬ 
tic as its objects. 

§ 10 . Functions. In the most general sense, a (single-valued) function 
f or f(x) or y — f(x) of one variable * is a correspondence by which, to each 
element x of a set X there corresponds a single element y of a set Y. 

The set X is the range of the independent variable, or the domain of the 
function. The function may be called a function from X to Y (or a function 
of a member of X taking a member of Y as value, or an operation on a 
member of X producing a member of Y, etc.). 

The range of the dependent variable y or f(x ) is the subset Y 1 of Y 
comprising the elements of Y used in the correspondence, i.e. those which 
correspond by the function / to some element of X. Then X and Y x 
are in many-one correspondence, since to each element of X there cor¬ 
responds just one element of Y x , but an element of Y x will (in general) 
correspond to many elements of X. An element x of X is an argument of 
the function or a value of the independent variable. The corresponding 
element y of Y is the corresponding value of the function or of the dependent 
variable, or the value of the function for that argument. (Sometimes “ar¬ 
gument” is encountered meaning “independent variable”.) 
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A (single-valued) function f or f(x 1 , ...,x n ) or y — f(x v of 

n variables x lt .... x n is a correspondence by which, to each ordered 
«-tuple (x v ..x n ) of objects where x x C X v x 2 & X 2 , ..., x n £ X n , there 
corresponds a single object y where y £ Y. A function of n variables can 
be considered as a function of one variable, with X as the class of all the 
ordered w-tuples (x v ..., x n ). Similar terminology applies. Thus X x is 
the range of x v X 2 of x 2 , ..., X n of x n . Here X lt X 2 , ..X n may all be 
the same set, or there may be several (up to n) different ranges. A par¬ 
ticular sequence x x , ..., x n of elements from X v .... X n , respectively, is 
a set (or n-tuple) of arguments. 

In this plethora of terminology, one may recognize a mixture of 
terminologies based on two ideas: the idea of a function as a many-one 
correspondence, and the idea of a function as a variable y which ranges 
in relation to another variable x so that the value of y is always fixed by 
that of x. 

The first idea is the more comprehensive one, which the student 
should keep uppermost in his mind. The second idea however gives rise 
m a natural way to the useful notational convention, whereby if “f{x)” 
for instance stands for a certain function of the independent variable 
x, and a, b, etc. are values of the independent variable (i.e. arguments), 
then "f(a)” stands for the value of the function for the argument a, 
“f{b)” for the value when x = b, etc. 

One should be aware that then “f(x)” may have either of two meanings: 
1 . The function itself (i.e. the many-one correspondence between X and 
Yj). 2. When x stands for an object from the domain, the corresponding 
value of the function (i.e. a member y of Y x ). When x is unspecified, the 
latter is called the ambiguous value of the function. 

Example 1 . When we say “x + y is symmetric” we mean by “x + y” 
the fhnction. When we say ‘‘the sum x + y of any two natural numbers 
x and y is ^ ;c”, we mean by “x + y” not a function but a number (the 
ambiguous value of the function). 

This situation can be avoided by using “f” instead of “f[x)” for the 
function, so long as we are talking only about functions for each of 
which a symbol such as “g”, “ + ” or “ 9 ” has been introduced. But 
notations which show the independent variables are very convenient for 
naming other functions composed out of those functions (and constants), 
e.g.'U(g(x)y’,“x* + 3 x” or“<?( 2 , *)”. 

Example 2. To consider this in more detail, say that / and g are 
given number-theoretic functions of one variable each, i.e. functions from 
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the set of the natural numbers to the same set. Let x be any natural 
number. Then g(x) is a natural number, i.e. the value of g for x as argument, 
and f(g{x)) is a natural number, i.e. the value of / for the natural number 
g(x) as argument. So to any natural number x, another number f{g(x)) 
is determined. Thus “f(g(x))” stands for the ambiguous value of a new 
function (Meaning 2 ); and it is also convenient to use it as a name for 
the new function itself (Meaning 1). 


There is another notation (due to Church 1932) in which the inde¬ 
pendent variables appear, but which represents the function / as distinct 
from its ambiguous value, namely: “X*/(*)”, or for a function of n variables, 

“X*1 ...*«/(*!.e.g. "X*/(g(*))’\ “X*** + 3 *”, “X* ? ( 2 ,*)”.We 

shall use this X-notation for emphasis in situations where especial care is 
required. 


Example 3. Let 9 be a function of two numbers. Using the X-notation 
consistently, i.e. whenever we mean the function instead of the ambiguous 
value, we can distinguish (a) the number y{x, y), (b) the function X* 9 (x, y ) 
of one variable x, with y as parameter, (c) the function 'kxyy(x, y) of two 
variables, with x as first and y as second variable, (d) the function 
~kyx 9 (x, y ), with y as first and x as second variable, (e) the function XxXy 9 (x, y) 
of one variable x, whose values are functions of another variable y, etc. 
(Schonfinkel 1924 and Church identify (c) and (e), but that is not necessary 
for us.) 

For any n-tuple t v ..., t n of arguments for /, 

{X*i • • • X n f{X\i • • • / • • • > tn) == j(t 1 , ■ • •» 

For example, {kx x 2 + 3 x}( 2 ) = 10, {Kx(p(x, y)}(0) = 9(0, y), 

{Xy* 9(x, y)}( 0 , 3) = 9 ( 3 , 0 ), {key y{x, y)}{z, x) = 9 (z, x). 

We have described a function as a many-one correspondence. One 
may go further in saying what a many-one correspondence is to be, 
according to the kind of theory one is working in. In set-theoretic terms, 
the correspondence can be identified with the set of all the ordered pairs 
(x, y) of corresponding elements of X and Y v One may speak instead of 
the law or rule establishing the correspondence, at least in dealing with 
such functions that a law or rule in some understood sense can be given 
for each function. In the case that X is a finite set, a function can be given 
as a table. 


Example 4. Let X and Y both be the residues modulo 2, i.e. 
X — Y = { 0 , 1}. The functions x' and x • y can be defined by the follow- 
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ing tables: 




y 

X • 

0 

y 

i 

x 0 

1 

x 0 

0 

0 

1 

0 

i 

0 

1 


Thus by the second table, 0 • 0 = 0 • 1 = 1-0 = 0 and 1-1 = 1 . 




Chapter III 


A CRITIQUE OF MATHEMATICAL REASONING 

§ 11. The paradoxes. This chapter is intended to present the problem 
situation out of which the investigations to be reported in the rest of the 
book arose, i.e. the situation preceding those investigations (but not 
how it has since changed). 

In the arithmetization of analysis (§ 9 ), an infinite collection (of ra¬ 
tional forming the lower half of a Dedekind cut, or of digits in sequence 
forming a non-terminating decimal, etc.) is constituted an object, and the 
set of all such objects is considered as a new collection. From this it is a 
natural step to Cantor’s general set theory. 

Hardly had these theories been consolidated, when the validity of 
the whole construction was cast into doubt by the discovery of paradoxes 
or antinomies in the fringes of the theory of sets. 

(A) The Burali-Forti paradox 1897*, also known to Cantor in 1895 , 
arises in Cantor’s theory of transfinite ordinals. 

(B) Somewhat similar antinomies occur in the theory of transfinite 
cardinals, particularly Cantor’s paradox (found by him in 1899 ). Consider 
the set of all sets; call it M. By Cantor’s theorem (Theorem C § 5 ), UM > M. 
Also, since M is the set of all sets, and UM is a set of sets (namely, the set 
of the subsets of M), UM C M. Hence by Corollary A Theorem A, 
UM < M; and so by § 3 , not UM > M- Thus we have proved both that 
UM > M and that not UM > M- 

Starting with the same M, we can also reach a paradox thus. To each 
member M of M, i.e. to any set M, by Theorem C there is another member 
M' of M, namely UM, such that M < M' . Hence by Theorem D, M < ©M 
for every member M of M. But M is the set of all sets, so ©M is one of its 
members. Taking the M in the inequality just proved to be this member, 
we have ©M < ©M. But by § 3, for any set M, not M < M‘, hence in 
particular, not ©M < ©M. 

The paradox with ©M results likewise, if we start out with the set of 
all cardinal numbers, and choose as M a set containing, to each cardinal 
number, a set M having that cardinal. 


36 
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If the notion of sets of arbitrary elements used here is thought to be 
too vague and hence unmathematical, we can prescribe as admissible 
elements of sets (a 2 ) the natural numbers 0, 1,2, ... (or (a 2 ) the empty set 
O) and (b) arbitrary sets whose members are admissible elements. With 
this prescription, the above paradoxes and the next arise as before (with 
(aj), Gentzen 1936). 

(C) The Russell paradox 1902-3*, discovered independently by Zermelo, 
deals with the set of all sets which are not members of themselves. Call 
this set T. Is T a member of itself ? 

Let us assume, for the sake of the argument, that T is a member of 
itself, i.e. in symbols T 6 T. The assumption says that T is a member of 
T, i.e. T is a member of the set of all sets which are not members of 
themselves, i.e. T is a set which is not a member of itself, i.e. in symbols 
T t T. This contradicts the assumption TET. Thus far we have no paradox, 
as the contradiction between T 6 T and T IT has arisen only under the 
assumption T QT. By reductio ad absurdum, we conclude that the 
assumption is false. Thus we have now proved outright, without as¬ 
sumption, that TIT. 

From the established result TIT, we can argue further. The result says 
that T is not a member of the set of all sets which are not members of 
themselves, i.e. T is not a set which is not a member of itself, i.e. T is 
a set which is a member of itself, i.e. in symbols T ET. Now T E T and 
TET are both established, so we have a paradox. 

This paradox can be extracted from Cantor’s thus. If we prescribe 
(a 2 ) and (b) as admissible elements, so that sets have only sets as members, 
then when M is the set of all sets, UM = M, and the set T of the paradox 
is obtained by applying the proof of Lemma A § 5 to the identical 1-1 
correspondence M ~ UM in which each element of M corresponds to itself 
in UM. 

A popularization cf the paradox (Russell 1919) concerns the barber in 
a certain village, who shaves all and only those persons in the village who 
do not shave themselves. Does he shave himself ? (Of course here we can 
escape the paradox simply by concluding that there never was such a 
barber.) 

Every municipality in Holland must have a mayor, and no two may 
have the same mayor. Sometimes it happens that the mayor is a non¬ 
resident of the municipality. Suppose a law is passed setting aside a 
special area S exclusively for such non-resident mayors, and compelling 
all non-resident mayors to reside there. 

Suppose further that there are so many non-resident mayors that S 
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has to be constituted a municipality. Where shall the mayor of 5 reside ? 
(Mannoury, cf. van Dantzig 1948.) 

Suppose the Librarian of Congress compiles, for inclusion in the 
Library of Congress, a bibliography of all those bibliographies in the Li¬ 
brary of Congress which do not list themselves. (Gonseth 1933.) 

Russell also showed how to recast his paradox in logical instead of set- 
theoretic terminology. A property is called ‘predicable’ if it applies to 
itself, ‘impredicable’ if it does not apply to itself. For example, the prop¬ 
erty ‘abstract’ is abstract, and hence predicable; but ‘concrete’ is also 
abstract and not concrete, and hence is impredicable. What about the 
property ‘impredicable’ ? 

(D) The Richard paradox 1905, also substantially given by Dixon 1906, , 
deals with the notion of finite definability. For definiteness, let this refer 
to a given language, say the English language with a preassigned alphabet, 
dictionary and grammar. The alphabet we may take as consisting of 
the blank space (to separate words), the 26 Latin letters, and the comma. 
By an ‘expression’ in the language we may understand simply any finite 
sequence of these 28 symbols not beginning with a blank space. The 
expressions in the English language can then be enumerated by the device 
which we applied at the end of § 1 to the enumeration of the algebraic 
equations. 

An expression may define a number-theoretic function of one variable 
(i.e. a function of a natural number taking a natural number as value). 
From the specified enumeration of all the expressions in the English 
language, by striking out those which do not define a number-theoretic 
function, we obtain an enumeration (say E 0 , E lt E 2 , ...) of those which 
do (say the functions defined are respectively f 0 (n), hin), f 2 (n), ...). 

Now consider the following expression, ‘‘the function whose value, for 
any given natural number as argument, is equal to one more than the 
value, for the given natural number as argument, of the function defined 
by the expression which corresponds to the given natural number in the 
last described enumeration”. 

In the quoted expression we refer to the above described enumeration 
of the expressions in the English language defining a number-theoretic 
function, without defining it. But we could easily have written in the 
definition of that enumeration in full, as part of the quoted expression. 
We should then have before us a definition of a function (briefly, the 
function /„(«) + 1 ), by an expression in the English language. This 
function, by its definition, must differ from every function definable by 
an expression in the English language. 
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This paradox is especially interesting for its implications concerning 
languages such as English, and because it runs so close to Cantor’s proof 
of the non-enumerability of the number-theoretic functions (§ 2). Richard 
gave the paradox in a form relating to the definition of a real number, 
paralleling Cantor’s proof of the non-enumerability of the real numbers. 

Consider the expression, “the least natural number not nameable 
in fewer than twenty-two syllables’’. This expression names in twenty- 
one syllables a natural number which by definition cannot be named in 
fewer than twenty-two syllables! (Berry 1906.) 

(E) These modern paradoxes, which fall more or less within the context 
of set theory, are related to a very ancient one. 

The statement “Cretans are always liars ...” is attributed to the 
philosopher Epimenides of Crete (sixth century B.C.). (The statement was 
quoted by Paul in “Epistle to Titus”, I, 12, as by a Cretan “prophet”, 
whom early Christian tradition, according to more recent sources, identi¬ 
fied with Epimenides. Cf. Weyl 1949 p. 228 .) 

Suppose we distinguish two kinds of liars: liars of the first kind, who 
tell the truth some of the time, and liars of the second kind, who tell only 
lies. Let us interpret Epimenides’ statement to mean that all Cretans are 
liars of the second kind. Suppose his statement were true. By what it 
says and the fact he is a Cretan, it must then be false. This is a con¬ 
tradiction; hence by reductio ad absurdum, the statement must be false. 
The falsity of the statement requires that there has been, or will eventually 
be, a Cretan who at some time tells the truth. Had the quoted statement 
been the only one any Cretan makes, we should have a paradox. It is 
logically unsatisfactory that we should escape paradox only through the his¬ 
torical accident that some Cretan existed who sometime told the truth. 

The Epimenides paradox, known also as the liar, appears in stark form, 
if a person says simply, “This statement I am now making is a lie.” The 
quoted statement can neither be true nor false without entailing a con¬ 
tradiction. This version of the paradox is attributed to Eubulides (fourth 
century B.C.), and was well known in ancient times. (Cf. Riistow 1910. 
If the statement “Cretans are always liars ...” is not authentically 
Epimenides’, or was not originally recognized as paradoxical, the Eubuli¬ 
des version of the Liar may then be older than the “lying Cretan” version.) 

In the ancient "dilemma of the crocodile”, a crocodile has stolen a 
child. The crocodile promises the child’s father to return the child, provid¬ 
ed that the father guesses whether the crocodile will return the child 
or not. What should the crocodile do, if the father guesses that the 
crocodile will not return the child? (Cf. Prantl 1855 p. 493 .) 
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The following riddle also turns upon the paradox. A traveller has fallen 
among cannibals. They offer him the opportunity to make a statement, 
attaching the conditions that if his statement be true, he will be boiled, 
and if it be false, he will be roasted. What statement should he make? 
(A form of this riddle occurs in Cervantes' "Don Quixote” ( 1605 ), II, 51 .) 

§ 12. First inferences from the paradoxes. The reader may try 
his hand at solving the paradoxes. In the half century since the problem has 
been open, no solution has been found which is universally agreed upon. 

The simplest kind of solution would be to locate a specific fallacy, like 
a mistake in a student’s algebra exercise or geometry proof, with nothing 
else needing to be changed. 

Ideas for solving the paradoxes in this sense come to mind on first 
considering them. One may propose that the error in the paradoxes 
(A) — (C) consists in using too large sets, such as the set of all sets or 
the set of all cardinal numbers; or in permitting sets to be considered as 
members of themselves, which again argues against the set of all sets. 
These suggestions are not necessarily wrong, but they are not after all 
simple. They leave us the problem of refounding set theory on a drastically 
altered basis, the details of which are not fully implicit in the suggestions. 
For example, if we ban the set of all cardinal numbers, we are unable 
to introduce the set of the natural numbers, unless we already know that 
they are not all the cardinal numbers; and the same difficulty will arise 
at higher stages. If we ban the set of all sets, we find ourselves in conflict 
with Cantor’s definition of set. In order to have set theory at all, we must 
have theorems about all sets, and all sets then constitute a set under 
Cantor’s definition. If not so, we must say what other definition of set we 
shall use instead, or we must supplement Cantor’s definition with some 
further criterion to determine when a collection of objects as described 
in his definition shall constitute a set (Skolem 1929-30). 

Axiomatic set theory. Reconstructions of set theory can be given, 
placing around the notion of set as few restrictions to exclude too large 
sets as appear to be required to forestall the known antinomies. Since 
the free use of our conceptions in constructing sets under Cantor’s def¬ 
inition led to disaster, the notions of set theory are governed by axioms, 
like those governing ‘point’ and ‘line’ in Euclidean plane geometry. The 
first system of axiomatic set theory was Zermelo’s (1908). Refinements 
in the axiomatic treatment of sets are due to Fraenkel (1922, 1925), 
Skolem (1922-3, 1929), von Neumann (1925, 1928), Bernays (1937-48), 
and others. Analysis can be founded on the basis of axiomatic set theory, 
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which perhaps is the simplest basis set up since the paradoxes for the 
deduction of existing mathematics. Some very interesting discoveries 
have been made in connection with axiomatic set theory, notably by 
Skolem (1922-3; cf. §75 below) and Godel (1938, 1939, 1940). 

The broader problem of foundations. Assuming that the para¬ 
doxes are avoided in the axiomatization of set theory — and of this the 
only assurance we have is the negative one that so far none have been 
encountered — does it constitute a full solution of the problem posed 
by the paradoxes? 

In the case of geometry, mathematicians have recognized since the 
discovery of a non-Euclidean geometry that more than one kind of space 
is possible. Axiom systems serve to single out one or another kind of space, 
or certain common features of several spaces, for the geometer to study. 
A contradiction arising in a formal axiomatic theory can mean simply 
that an unrealizable combination of features has been postulated. 

But in the case of arithmetic and analysis, theories culminating in 
set theory, mathematicians prior to the current epoch of criticism general¬ 
ly supposed that they were dealing with systems of objects, set up 
genetically, by definitions purporting to establish their structure com¬ 
pletely. The theorems were thought of as expressing truths about these 
systems, rather than as propositions applying hypothetically to whatever 
systems of objects (if any) satisfy the axioms. But then how could con¬ 
tradictions have arisen in these subjects, unless there is some defect in 
the logic, some error in the methods of constructing and reasoning about 
mathematical objects, which we had hitherto trusted? 

To say that now these subjects should instead be established on an 
axiomatic basis does not of itself dispose of the problem. After axiomati¬ 
zation, there must still be some level at which we have truth and falsity. 
If the axiomatics is informal, the axioms must be true. If the axiomatics 
is formal, at least we must believe that the theorems do follow from the 
axioms; and also there must be some relationship between these results 
and some actuality outside the axiomatic theory, if the mathematicians’ 
activity is not to reduce to nonsense. The formally axiomatized propo¬ 
sitions of mathematics cannot constitute the whole of mathematics; 
there must also be an intuitively understood mathematics. If we must 
give up our former belief that it comprises all of arithmetic, analysis and 
set theory, we shall not be wholly satisfied unless we learn wherein that 
belief was mistaken, and where now instead to draw a line of separation. 

The immediate problem of eliminating the paradoxes thus merges 
with the broader problem of the foundations of mathematics and logic. 
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What is the nature of mathematical truth ? What meaning do mathemat¬ 
ical propositions have, and on what evidence do they rest? This broad 
problem, or complex of problems, exists for philosophy apart from the 
circumstance that paradoxes have arisen in the fringes of mathematics. 
Historically, this circumstance has led to a more intensive study of the 
problem on the part of mathematicians than would otherwise have 
been likely; and the paradoxes obviously impose conditions on the 
solution of the problem. 

Impredicative definition. When a set M and a particular object m 
are so defined that on the one hand m is a member of M, and on the 
other hand the definition of w depends on M, we say that the procedure 
(or the definition of m, or the definition of M) is impredicative. Similarly, 
when a property P is possessed by an object m whose definition depends 
on P (here M is the set of the objects which possess the property P). 
An impredicative definition is circular, at least on its face, as what is 
defined participates in its own definition. 

Each of the antinomies of § 1 1 involves an impredicative definition. 
In (B), the set M of all sets includes as members the sets UM and ©M 
defined from M. The impredicative procedure in the Russell paradox (C) 
stands out when the definition of T is elaborated thus. We divide the set 
M of all sets into two parts, the first comprising those members which 
contain themselves, and the second (which is T) those which do not. 
Then we put T (defined by this division of M into two parts) back into M, 
to ask into which part of M it falls. In the Richard paradox (D), the 
totality of expressions in the English language which constitute def¬ 
initions of a function (real number, natural number) is taken as including 
the quoted expression, which refers to that totality. In the Epimenides 
paradox (E), the totality of statements is divided into two parts, the true 
and the false statements. A statement which refers to this division is 
reckoned as of the original totality, when we ask whether it is true or 
false. 

Poincare (1905-6,1908) judged the cause of the paradoxes to lie in these 
impredicative definitions; and Russell (1906, 1910) enunciated the same 
explanation in his vicious circle principle: No totality can contain 
members definable only in terms of this totality, or members involving or 
presupposing this totality. Thus it might appear that we have a sufficient 
solution and adequate insight into the paradoxes, except for one cir¬ 
cumstance: parts of mathematics we want to retain, particularly analysis, 
also contain impredicative definitions. 

An example is the definition of u = l.u.b. M (§ 9 (A)). Under the 
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Dedekind cut definition of the real numbers, the set C of real numbers 
is the set of all sets x of rationals having three properties (a), (b), (c). 
Now this totality has been divided into two parts, M and C — M. We 
define u as 3 M, and then reckon this set @M as a member of C. This 
definition u = @M depends on C in the general case, since in the general 
case M will have been defined from C as the set of those members of C 
which have a certain property P. 

One can attempt to defend this impredicative definition by inter¬ 
preting it, not as defining or creating the real number u for the first time 
(in which interpretation the definition of the totality C of real numbers 
is circular), but as only a description which singles out the particular 
number u from an already existing totality C of real numbers. But the 
same argument can be used to uphold the impredicative definitions in the 
paradoxes. 

Weyl’s constructive continuum. The impredicative character of 
some of the definitions in analysis has been especially emphasized by 
Weyl, who in his book "Das Kontinuum (The continuum)” (1918) 
undertook to find out how much of analysis could be reconstructed 
without impredicative definitions. A fund of operations can be provided 
for constructing many particular categories of irrationals. Weyl was thus 
able to obtain a fair part of analysis, but not the theorem that an ar¬ 
bitrary non-empty set M of real numbers having an upper bound has a 
least upper bound. (Cf. also Weyl 1919.) 

There have arisen three main schools of thought on the foundations of 
mathematics: (i) the logicistic school (Russell and Whitehead, English), 
(ii) the intuitionistic school (Brouwer, Dutch), and (iii) the formalistic or 
axiomatic school (Hilbert, German). (Sometimes "logistic” is used instead 
of “logicistic”; but "logistic” also has another meaning § 15 .) This broad 
classification does not include various other points of view, which have 
not been as widely cultivated or do not comprise to a similar degree both 
a reconstruction of mathematics and a philosophy to support it. 

Logicism. The logicistic thesis is that mathematics is a branch of 
logic. The mathematical notions are to be defined in terms of the logical 
notions. The theorems of mathematics are to be proved as theorems of logic. 

Leibniz ( 1666 ) first conceived of logic as a science containing the ideas 
and principles underlying all other sciences. Dedekind (1888) and Frege 
(1884, 1893, 1903) were engaged in defining mathematical notions in 
terms of logical ones, and Peano (1889, 1894-1908) in expressing mathe¬ 
matical theorems in a logical symbolism. 
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To illustrate how mathematical notions can be defined from logical 
ones, let us presuppose the Frege-Russell definition of cardinal number 
(§ 3 ), and the definitions of the cardinal number 0 and of the cardinal 
number n 1 for any cardinal number n (§ 4 ). Then a finite cardinal 
(or natural number) can be defined as a cardinal number which possesses 
every property P such that ( 1)0 has the property P and ( 2 ) n -f- 1 has 
the property P whenever n has the property P. In brief, a natural number 
is defined as a cardinal number for which mathematical induction holds. 
The viewpoint here is very different from that of §§ 6 and 7 , where we 
presupposed an intuitive conception of the natural number sequence, 
and elicited from it the principle that, whenever a particular property 
P of natural numbers is given such that (1) and (2), then any given natural 
number must have the property P. Here instead we presuppose the totality 
of all properties of cardinal numbers as existing in logic, prior to the def¬ 
inition of the natural number sequence. Note that this definition is im- 
predicative, because the property of being a natural number, which it 
defines, belongs to the totality of properties of cardinal numbers, which 
is presupposed in the definition. 

To adapt the logicistic construction of mathematics to the situation 
arising from the discovery of the paradoxes, Russell excluded impred- 
icative definitions by his ramified theory of types (1908, 1910). Roughly, 
this is as follows. The primary objects or individuals (i.e. the given things 
not being subjected to logical analysis) are assigned to one type (say 
type 0), properties of individuals to type 1 , properties of properties of 
individuals to type 2, etc.; and no properties are admitted which do not 
fall into one of these logical types (e.g. this puts the properties ‘pred¬ 
icable' and ‘impredicable’ of § 11 outside the pale of logic). A more 
detailed account would describe the admitted types for other objects, 
such as relations and classes. Then, to exclude impredicative definitions 
within a type, the types above type 0 are further separated into orders. 
Thus for type 1, properties defined without mentioning any totality 
belong to order 0, and properties defined using the totality of properties 
of a given order belong to the next higher order. (The logicistic definition 
of natural number now becomes predicative, when the P in it is specified 
to range only over properties of a given order, in which case the property 
of being a natural number is of the next higher order.) But this separation 
into orders makes it impossible to construct the familiar analysis, which 
as we saw above contains impredicative definitions. To escape this 
outcome, Russell postulated his axiom of reducibility, which asserts that to 
any property belonging to an order above the lowest, there is a coextensive 
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property (i.e. one possessed by exactly the same objects) of order 0. If only 
definable properties are considered to exist, then the axiom means that 
to every impredicative definition within a given type there is an equiva¬ 
lent predicative one. 

The deduction of mathematics as a province of logic was carried out on 
this basis, using a logical symbolism, in the monumental''Principia mathe- 
matica” of Whitehead and Russell (three volumes, 1910-13). This work 
has had a great influence on subsequent developments in symbolic logic. 

This deduction of mathematics from logic was offered as intuitive 
axiomatics. The axioms were intended to be believed, or at least to be 
accepted as plausible hypotheses concerning the world. 

The difficulty is now: on what grounds shall we believe in the axiom 
of reducibility ? If properties are to be constructed, the matter should be 
settled on the basis of constructions, not by an axiom. As the authors 
admitted in the introduction to their second edition ( 1925 ), "This axiom 
has a purely pragmatic justification: it leads to the desired results, and 
to no others [so far as is known]. But clearly it is not the sort of axiom 
with which we can rest content.” 

Ramsey 1926 found that the desired results and no others can ap¬ 
parently be obtained without the hierarchy of orders (i.e. with a simple 
theory of types). He classified the known antinomies into two sorts, now 
called ‘logical’ (e.g. the Burali-Forti, Cantor and Russell) and ‘epistemo¬ 
logical’ or ‘semantical’ (e.g. the Richard and Epimenides); and he observed 
that the logical antinomies are (apparently) stopped by the simple 
hierarchy of types, and the semantical ones are (apparently) prevented 
from arising within the symbolic language by the absence therein of the 
requisite means for referring to expressions of the same language. But 
Ramsey’s arguments to justify impredicative definitions within a type 
entail a conception of the totality of predicates of the type as existing 
independently of their constructibility or definability. This has been 
called ‘‘theological”. Thus neither Whitehead and Russell nor Ramsey 
succeeded in attaining the logicistic goal constructively. (An interesting 
proposal for justifying impredicative definitions within a type, by Lang¬ 
ford 1927 and Carnap 1931-2, is also not free of difficulties.) 

Weyl 1946 says that, in the system of "Principia mathematica”, 
"mathematics is no longer founded on logic, but on a sort of logician’s 
paradise ..and he observes that one who is ready to believe in this 
"transcendental world” could also accept the system of axiomatic set 
theory (Zermelo, Fraenkel, etc.), which, for the deduction of mathematics, 
has the advantage of being simpler in structure. 
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Logicism treats the existence of the natural number series as an 
hypothesis about the actual world (‘axiom of infinity’). A quite different 
handling of the problem of infinity is proposed by the intuitionists (§ 13 ) 
and the formalists (§ 14 ). 

From both the intuitionistic and the formalistic standpoints, the 
(abstract) natural number sequence is more elementary than the notions 
of cardinal number and of all properties of cardinal numbers, which are 
used in the logicistic characterization of it. 

The logicistic thesis can be questioned finally on the ground that logic 
already presupposes mathematical ideas in its formulation. In the in¬ 
tuitionistic view, an essential mathematical kernel is contained in the 
idea of iteration, which must be used e.g. in describing the hierarchy of 
types or the notion of a deduction from given premises. 

Recent work in the logicistic school is that of Quine 1940*. A critical 
but sympathetic discussion of the logicistic order of ideas is given by Godel 
1944. Introductory treatments are provided by Russell 1919 and Black 1933. 

§ 13. Intuitionism. In the 1880 ’s, when the methods of Weierstrass, 
Dedekind and Cantor were flourishing, Kronecker argued vigorously 
that their fundamental definitions were only words, since they do not 
enable one in general to decide whether a given object satisfies the 
definition. 

Poincare, when he defends mathematical induction as an irreducible 
tool of intuitive mathematical reasoning (1902,1905-6), is also a fore¬ 
runner of the modern intuitionistic school. 

In 1908 Brouwer, in a paper entitled “The untrustworthiness of the 
principles of logic”, challenged the belief that the rules of the classical 
logic, which have come down to us essentially from Aristotle ( 384—322 
B.C.), have an absolute validity, independent of the subject matter to 
which they are applied. Quoting from Weyl 1946, “According to his 
view and reading of history, classical logic was abstracted from the 
mathematics of finite sets and their subsets. ... Forgetful of this limited 
origin, one afterwards mistook that logic for something above and prior 
to all mathematics, and finally applied it, without justification, to the 
mathematics of infinite sets.” 

Two obvious examples will illustrate that principles valid in thinking 
about finite sets do not necessarily carry over to infinite sets. One is the 
principle that the whole is greater than any proper part, when applied 
to 1-1 correspondences between sets (§§ 1, 3 , 4 ). Another is that a set of 
natural numbers contains a greatest. 
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A principle of classical logic, valid in reasoning about finite sets, which 
Brouwer does not accept for infinite sets, is the law of the excluded middle. 
The law, in its general form, says for every proposition A, either A or not A. 
Now let A be the proposition there exists a member of the set (or domain) D 
having the property P. Then not A is equivalent to every member of D does 
not have the property P, or in other words every member of D has the property 
not-P. The law, applied to this A , hence gives either there exists a member 
of D having the property P, or every member of D has the property not-P. 

For definiteness, let us specify P to be a property such that, for any 
given member of D, we can determine whether that member has the 
property P or does not. 

Now suppose D is a finite set. Then we could examine every member of 
D in turn, and thus either find a member having the property P, or verify 
that all members have the property not-P. There might be practical dif¬ 
ficulties, e.g. when D is a very large set having say a million members, 
or even for a small D when the determination whether or not a given 
member has the property P may be tedious. But the possibility of com¬ 
pleting the search exists in principle. It is this possibility which for Brouw¬ 
er makes the law of the excluded middle a valid principle for reasoning 
with finite sets D and properties P of the kind specified. 

For an infinite set D, the situation is fundamentally different. It is no 
longer possible in principle to search through the entire set D. 

Moreover in this situation the law is not saved for Brouwer by sub¬ 
stituting, for the impossible search through all the members of the 
infinite set D, a mathematical solution of the problem posed. We may in 
some cases, i.e. for some sets D and properties P, succeed in finding a 
member of D having the property P ; and in other cases, succeed in show¬ 
ing by mathematical reasoning that every member of D has the property 
not-P, e.g. by deducing a contradiction from the assumption that an 
arbitrary (i.e. unspecified) member of D has the property P. (An example 
for the second kind of solution is when D is the set of all the ordered pairs 
(m, n) of positive integers, and P is the property of a pair (m, n) that 
m 2 = 2 n 2 . The result is then Pythagoras’ discovery that \/2 is irrational.) 
But we have no ground for affirming the possibility of obtaining either 
one or the other of these kinds of solutions in every case. 

An example from modern mathematical history is afforded by Fermat’s 
“last theorem”, which asserts that the equation x n + y n = z n has no 
solution in positive integers x, y, z, n with n > 2 . (For n = 2 , there are 
triples of positive integers, called Pythagorean numbers, which satisfy, 
e.g. x — 3 , y == 4 , z = 5 or x = 5 , y = 12, z — 13 .) Here D is the set of 
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all ordered quadruples (x, y, z, n) of positive integers with n > 2, and 
P is the property of a quadruple ( x , y, z, n) that x n + y n — z n . About 1637 
Fermat wrote on the margin of his copy of Bachet’s “Diophantus” that 
he had discovered a truly marvellous demonstration of this “theorem” 
which the margin was too narrow to contain. Despite an immense ex¬ 
penditure of effort, no one since then has succeeded in proving or dis¬ 
proving the alleged “theorem”; and moreover we lack the knowledge of 
any systematic method, the pursuit of which must in principle ultimately 
lead to a determination as to its truth or falsity. (Cf. Vandiver 1946 for 
details.) 

Brouwer’s non-acceptance of the law of the excluded middle for in¬ 
finite sets D does not rest on the failure of mathematicians thus far to 
have solved this particular problem, or any other particular problem. To 
meet his objection, one would have to provide a method adequate in 
principle for solving not only all the outstanding unsolved mathematical 
problems, but any others that might ever be proposed in the future. How 
likely it is that such a method will be found, we leave for the time being 
to the reader to speculate. Later in the book we shall return to the 
question (§ 60 ). 

The familiar mathematics, with its methods and logic, as developed 
prior to Brouwer’s critique or disregarding it, we call classical ; the 
mathematics, methods or logic which Brouwer and his school allow, we 
call intuitionistic. The classical includes parts which are intuitionistic and 
parts which are non-intuitionistic. 

The non-intuitionistic mathematics which culminated in the theories 
of Weierstrass, Dedekind and Cantor, and the intuitionistic mathematics 
of Brouwer, differ essentially in their view of the infinite. In the former, 
the infinite is treated as actual or completed or extended or existential. An 
infinite set is regarded as existing as a completed totality, prior to or in¬ 
dependently of any human process of generation or construction, and as 
though it could be spread out completely for our inspection. In the latter, 
the infinite is treated only as potential or becoming or constructive. The 
recognition of this distinction, in the case of infinite magnitudes, goes 
back to Gauss, who in 1831 wrote, “I protest ... against the use of an 
infinite magnitude as something completed, which is never permissible 
in mathematics.” (Werke VIII p. 216 .) 

According to Weyl 1946, “Brouwer made it clear, as I think beyond 
any doubt, that there is no evidence supporting the belief in the existential 
character of the totality of all natural numbers .... The sequence of 
numbers which grows beyond any stage already reached by passing to the 
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next number, is a manifold of possibilities open towards infinity; it 
remains forever in the status of creation, but is not a closed realm of 
things existing in themselves. That we blindly converted one into the 
other is the true source of our difficulties, including the antinomies — a 
source of more fundamental nature than Russell’s vicious circle principle 
indicated. Brouwer opened our eyes and made us see how far classical 
mathematics, nourished by a belief in the ‘absolute’ that transcends all 
human possibilities of realization, goes beyond such statements as can 
claim real meaning and truth founded on evidence.” 

Brouwer’s criticism of the classical logic as applied to an infinite set D 
(say the set of the natural numbers) arises from this standpoint respecting 
infinity. We see this clearly by considering the meanings which the in- 
tuitionist attaches to various forms of statements. 

A generality statement all natural numbers n have the property P, or 
briefly for all n, P(n), is understood by the intuitionist as an hypothetical 
assertion to the effect that, if any particular natural number n were given 
to us, we could be sure that that number n has the property P. This is a 
meaning which does not require us to take into view the classical com¬ 
pleted infinity of the natural numbers. 

Mathematical induction is an example of an intuitionistic method for 
proving generality propositions about the natural numbers. A proof by 
induction of the proposition for all n, P(n ) shows that any given n would 
have to have the property P, by reasoning which uses only the numbers 
from 0 up to n (§ 7 ). Of course, for a particular proof by induction to be 
intuitionistic, also the reasonings used within its basis and induction step 
must be intuitionistic. 

An existence statement there exists a natural number n having the property 
P, or briefly there exists an n such that P{n), has its intuitionistic meaning 
as a partial communication (or abstract) of a statement giving a par¬ 
ticular example of a natural number n which has the property P, or at 
least giving a method by which in principle one could find such an 
example. 

Therefore an intuitionistic proof of the proposition there exists an n 
such that P(n) must be constructive in the following (strict) sense. The 
proof actually exhibits ah example of an n such that P(»), or at least in¬ 
dicates a method by which one could in principle find such an example. 

In classical mathematics there occur non-constructive or indirect 
existence proofs, which the intuitionists do not accept. For example, to 
prove there exists an n such that P{n), the classical mathematician may 
deduce a contradiction from the assumption for all n, not P(n). Under 
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both the classical and the intuitionistic logic, by reductio ad absurdum 
this gives not for all n, not P(n). The classical logic allows this result to 
be transformed into there exists an n such that P(n), but not (in general) 
the intuitionistic. Such a classical existence proof leaves us no nearer 
than before the proof was given to having an example of a number n such 
that P(n) (though sometimes we may afterwards be able to discover one 
by another method). The intuitionist refrains from accepting such an 
existence proof, because its conclusion there exists an n such that P(n) 
can have no meaning for him other than as a reference to an example of 
a number n such that P(n), and this example has not been produced. The 
classical meaning, that somewhere in the completed infinite totality of 
the natural numbers there occurs an n such that P(n), is not available to 
him, since he does not conceive the natural numbeis as a completed 
totality. 

As another example of a non-constructive existence proof, suppose it 
has been shown for a certain P, by intuitionistic methods, that if Fermat’s 
“last theorem” is true, then the number 5013 has the property P, and also 
that if Fermat’s “last theorem” is false, then 10 has the property P. 
Classically this suffices to demonstrate the existence of a number n such 
that P(m). But with the problem of the “last theorem” unsolved, Brouwer 
would disallow such an existence proof, because no example has been 
given. We do not know that 5013 is an example, nor do we know that 
10 is an example, nor do we know any procedure which would in principle 
(i.e. apart from practical limitations on the length of procedures we can 
carry out) lead us to a particular number which we could be sure is an 
example. Brouwer would merely accept what has been given as proving 
the implication (or conditional statement) if F or not F, then there exists an 
n such that P{n), where F is the statement for all x, y, z > 0 and n > 2 , 
x n + y” ^ z n . The classical mathematician, by his law of the excluded 
middle, has the premise F or not F of this implication, and so he can infer 
its conclusion there exists an n such that P(n). But in the present state of 
knowledge, Brouwer does not accept the premise F or not F as known. 

As appears in this example, intuitionistic methods are to be dis¬ 
tinguished from non-intuitionistic ones in the case of definitions as well 
as in the case of proofs. In the present state of our knowledge, Brouwer 
does not accept the number n which is equal to 5013 if F, and equal to 10 
if not F aS a valid definition of a natural number n. 

A disjunction A or B constitutes for the intuitionist an incomplete 
communication of a statement telling us that A holds or that B holds, 
or at least giving a method by which we can choose from A and B one 
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which holds. A conjunction A and B means that both A and B hold. An 
implication A implies B (or if A , then B) expresses that B follows from A 
by intuitionistic reasoning, or more explicitly that one possesses a 
method which, from any proof of A, would procure a proof of B; and a 
negation not A (or A is absurd) that a contradiction B and not B follows 
from A by intuitionistic reasoning, or more explicitly that one possesses 
a method which, from any proof of A, would procure a proof of a contra¬ 
diction B and not B (or of a statement already known to be absurd, such 
as 1 = 0 ). Additional comments on these intuitionistic meanings will be 
given in § 82 . See Note 1 on p. 65 . 

Quoting from Heyting 1934, “According to Brouwer, mathematics 
is identical with the exact part of our thinking. ... no science, in par¬ 
ticular not philosophy or logic, can be a presupposition for mathematics. 
It would be circular to apply any philosophical or logical principles as 
means of proof, since mathematical conceptions are already presupposed 
in the formulation of such principles.” There remains for mathematics 
"no other source than an intuition, which places its concepts and in¬ 
ferences before our eyes as immediately clear.” This intuition “is nothing 
other than the faculty of considering separately particular concepts and 
inferences which occur regularly in ordinary thinking.” The idea of the 
natural number series can be analyzed as resting on the possibility, first 
of considering an object or experience as given to us separately from the 
rest of the world, second of distinguishing one such from another, and 
third of imagining an unlimited repetition of the second process. “In the 
intuitionistic mathematics, one does not draw inferences according to 
fixed norms, which can be collected in a logic, but each single inference is 
immediately tested on its evidence.” But also “There are general 
rules, by which from given mathematical theorems new theorems can be 
formed in an intuitively clear way; the theory of these connections can 
be treated in a ‘mathematical logic’, which is then a branch of mathe¬ 
matics and is not sensibly applied outside of mathematics.” 

We turn now to the question: How large a part do the non-intui- 
tionistic methods play in the classical mathematics? 

The fact that non-intuitionistic methods occur in classical elementary 
number theory is significant, since it enables elementary number theory 
to serve as the first and simplest testing ground in research on founda¬ 
tions growing out of the intuitionistic and formalistic thinking. We 
shall be almost wholly concerned with elementary number theory in 
this book. 

Actually, in the existing body of elementary number theory, the 
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non-intuitionistic methods do not play a large part. Most non-constructive 
existence proofs can be replaced by constructive ones. 

On the other hand, in analysis (and still more transcendental branches 
of mathematics) the non-intuitionistic methods of definition and proof 
permeate the whole methodology. The real numbers in the Dedekind cut 
representation are infinite sets of rationals (§ 9 ). Thus to treat them as 
objects in the usual way, we are already using the completed infinite. 
In particular, we do apply the law of the excluded middle to these sets, 
in connection with the simplest definitions of the subject. For example, 
to show that for any two real numbers x and y, either x < y or x = y 
or x > y, we use it twice, thus: Either there exists a rational r in y 
which does not belong to x, or all rationals in y belong tox; and similarly 
interchanging x and y. In the impredicative definition of l.u.b. M (§ 9 

(A) , § 12 ), we use the totality of the real numbers in the same way. 
Another instance of non-constructive reasoning occurs in the proof of 

(B) § 9 , where we assumed the right to choose an element a n from a set 
M n , simultaneously for infinitely many values of n, without giving any 
property to determine which element is chosen. (This is a case of the 
'axiom of choice’, first noticed as an assumption by Zermelo 1904. We 
used it also for Theorem B § 4 .) 

Although the completed infinite has been banned for magnitudes (as 
Gauss called upon us to do), it reappears in full force for collections. 
As Hilbert and Bemays describe the situation in their “Grundlagen der 
Mathematik (Foundations of mathematics)”, vol. 1 (1934), p. 41 , “The 
... arithmetization of analysis is not without a residue left over, as certain 
systematic fundamental conceptions are introduced which do not belong 
to the domain of intuitive arithmetical thinking. The insight which has 
given us the rigorous foundation of analysis consists in this: that these few 
fundamental assumptions do suffice for building up the theory of magni¬ 
tudes as a theory of sets of integers.” 

The next question is: What kind of a mathematics can be built within 
the intuitionistic restrictions? If the existing classical mathematics could 
be rebuilt within the intuitionistic restrictions, without too great increase 
in the labor required and too great sacrifices in the results achieved, the 
problem of its foundations would appear to be solved. 

The intuitionists have created a whole new mathematics, including 
a theory of*the continuum and a set theory (cf. Heyting 1934). This 
mathematics employs concepts and makes distinctions not found in the 
classical mathematics; and it is very attractive on its own account. As 
a substitute for classical mathematics it has turned out to be less powerful, 
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and in many ways more complicated to develop. For example, in Brouwer’s 
theory of the continuum, we cannot affirm that any two real numbers 
a and b are either equal or unequal. Our knowledge about the equality or 
inequality of a and b can be more or less specific. By a b, it is meant 
that a = b leads to a contradiction, while a # b is a stronger kind of 
inequality which means that one can give an example of a rational number 
which separates a and b. Of course a # b implies a =£ b. But there are 
pairs of real numbers a and b for which it is not known that either a = b 
or a =£ b (or a # b). It is clear that such complications replace the clas¬ 
sical theory of the continuum by something much less perspicuous in 
form. 

Despite this, the possibility of an intuitionistic reconstruction of 
classical mathematics in a different way involving reinterpretation (re¬ 
cently undertaken) is not to be ruled out (cf. § 81 ). 

§ 14. Formalism. Brouwer has revealed what the genetic or con¬ 
structive tendency involves in its ultimate refinement; Hilbert does 
the same for the axiomatic or existential (§ 8). The axiomatic method 
had already been sharpened from the material axiomatics of Euclid to 
the formal axiomatics of Hilbert’s “Grundlagen der Geometrie” (1899). 
Formalism is the result of a further step, to meet the crisis caused by the 
paradoxes and the challenge to classical mathematics by Brouwer and 
Weyl. This step was forecasted by Hilbert in 1904, and seriously under¬ 
taken by him and his collaborators Bernays, Ackermann, von Neumann 
and others since 1920 (cf. Bernays 1935a, Weyl 1944). 

Hilbert conceded that the propositions of classical mathematics which 
involve the completed infinite go beyond intuitive evidence. But he re¬ 
fused to follow Brouwer in giving up classical mathematics on this account. 

To salvage classical mathematics in the face of the intuitionistic criti¬ 
cism, he proposed a program which we can state preliminarily as follows: 
Classical mathematics shall be formulated as a formal axiomatic theory, 
and this theory shall be proved to be consistent, i.e. free from contra¬ 
diction. 

Prior to this proposal of Hilbert's, the method used in consistency proofs 
for axiomatic theories, especially in Hilbert’s earlier axiomatic thinking, 
was to give a ‘model’. A model for an axiomatic theory is simply a system 
of objects, chosen from some other theory and satisfying the axioms 
(§ 8). That is, to each object or primitive notion of the axiomatic theory, 
an object or notion of the other theory is correlated, in such a way that 
the axioms become (or correspond to) theorems of the other theory. 
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If this other theory is consistent, then the axiomatic theory must be. 
For suppose that, in the axiomatic theory, a contradiction were deducible 
from the axioms. Then, in the other theory, by corresponding inferences 
about the objects constituting the model, a contradiction would be de¬ 
ducible from the corresponding theorems. , 

In a famous early example, Beltrami ( 1868 ) showed that the lines in 
the plane non-Euclidean geometry of Lobatchevsky and Bolyai (the plane 
hyperbolic geometry) can be represented by the geodesics on a surface of 
constant negative curvature in Euclidean space. Thus the plane hyper¬ 
bolic geometry is consistent, if the Euclidean geometry is consistent. 
(Another model for the same was given by Klein ( 1871 ) in terms of plane 
projective geometry with Cayley's metric ( 1859 ); this can be construed 
as a model in the Euclidean plane. Cf. Young ign Lectures II and III.) 

The analytic geometry of Descartes ( 1619 ), i.e. the use of coordinates 
to represent geometrical objects, constitutes a general method for 
establishing the consistency of geometric theories on the basis of analysis, 
i.e. the theory of the real numbers. 

Consistency proofs by the method of a model are relative. The theory 
for which a model is set up is consistent, if that from which the model is 
taken is consistent. 

Only when the latter is unimpeachable does the model give us an 
absolute proof of consistency. Veblen and Bussey 1906 achieve absolute 
proofs of consistency for certain rudimentary projective geometries by 
setting up models using only a finite (sic!) class of objects to represent 
the points (cf. Young 1911 Lectures IV and V). 

For proving absolutely the consistency of classical number theory, 
of analysis, and of set theory (suitably axiomatized), the method of a 
model offers no hope. No mathematical source is apparent for a model 
which would not merely take us back to one of the theories previously 
reduced by the method of a model to these. 

The impossibility of drawing upon the perceptual or physical world 
for a model is argued in Hilbert and Bernays 1934 pp. 15 — 17 . They 
illustrate it by considering Zeno’s first paradox (fifth century B.C.), 
according to which a runner cannot run a course in a finite time. For 
before he can do so, he must run the first half, then the next quarter, 
then the next eight, and so on. But this would require him to complete 
an infinite number of acts. The usual solution of the paradox consists 
in observing that the infinite series of the time intervals required to run 
the successive segments converges. “Actually there is also a much more 
radical solution of the paradox. This consists in the consideration that 
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we are by no means obliged to believe that the mathematical space-time 
representation of motion is physically significant for arbitrarily small 
space and time intervals; but rather have every basis to suppose that that 
mathematical model extrapolates the facts of a certain realm of experience, 
namely the motions within the orders of magnitude hitherto accessible to 
our observation, in the sense of a simple concept construction, similarly 
to the way the mechanics of continua completes an extrapolation in which 
a continuous filling of the space with matter is assumed .... The situation 
is similar in all cases where one believes it possible to exhibit directly 
an [actual] infinity as given through experience or perception .... 
Closer examination then shows that an infinity is actually not given 
to us at all, but is first interpolated or extrapolated through an intel¬ 
lectual process.” 

Therefore, if consistency is to be proved for number theory (including 
its non-intuitionistic portions), for analysis, etc., it must be by another 
method. It is Hilbert’s contribution now to have conceived a new direct 
approach, and to have recognized what it involves for the axiomatization. 
This direct method is implicit in the meaning of consistency (at least as 
we now think of it), namely that no logical contradiction (a proposition 
A and its negation not A both being theorems) can arise in the theory 
deduced from the axioms. Thus to prove the consistency of a theory 
directly, one should prove a proposition about the theory itself, i.e. 
specifically about all possible proofs of theorems in the theory. The 
mathematical theory whose consistency it is hoped to prove then be¬ 
comes itself the object of a mathematical study, which Hilbert calls 
‘‘metamathematics” or ‘‘proof theory”. How this is possible, and what 
the methods of the study may be, we shall examine in the next section. 

Meanwhile let us consider further the import of Hilbert’s proposal. 
Hilbert (1926, 1928) draws a distinction between ‘real’ and ‘ideal’ state¬ 
ments in classical mathematics, in essence as follows. The real statements 
are those which are being used as having an intuitive meaning; the ideal 
statements are those which are not being so used. The statements which 
correspond to the treatment of the infinite as actual are ideal. Classical 
mathematics adjoins the ideal statements to the real, in order to retain 
the simple rules of the Aristotelian logic in reasoning about infinite sets. 

The addition of ‘ideal elements’ to a system to complete its structure 
and simplify the theory of the system is a common and fruitful device in 
modern mathematics. For example, in Euclidean plane geometry two 
distinct lines intersect in a unique point, except when the lines are parallel. 
To remove this exception, Poncelet in his projective geometry ( 1822 ) 



56 


A CRITIQUE OF MATHEMATICAL REASONING 


CH. Ill 


introduced a point at infinity on each of the original lines, such that par¬ 
allel lines have the same point at infinity and non-parallel lines have 
different points at infinity. The totality of these points at infinity make 
up a line at infinity. As a line through a finite point of the projective plane 
rotates, its point at infinity traces out the line at infinity. By this device, 
the relationships of incidence between points and lines is simplified. Two 
distinct points determine a unique line (which is ‘on’ both points, i.e. 
through both of which the line passes); and two distinct lines determine 
a unique point (which is on both lines). These two propositions are duals 
of each other. There is a general principle, called the principle of duality 
for plane projective geometry, which says that to each theorem of the 
subject the statement obtained from it by interchanging the words 
“point” and “line” is also a theorem. 

As other examples of the addition of elements to a previously con¬ 
stituted system of elements to serve some theoretical purpose, we may 
take the successive enlargements of the number system, starting say 
with the natural numbers, then adjoining the negative integers, then the 
fractions, then the irrationals, and finally the imaginary numbers. The 
adjunction of the negative integers simplifies the theory of addition by 
making the inverse operation (subtraction) always possible; etc. 

Hilbert’s problem is crudely analogous to the problem which existed 
when imaginary numbers first came into use. As they were then not 
clearly understood, one might have proposed to justify their use to 
doubters by proving that, if imaginaries are used according to prescribed 
rules to derive a result expressed in terms of reals only, then that result 
must be correct. Of course, this kind of justification for imaginaries rel¬ 
ative to reals is not needed now, since their interpretation by points in 
the plane (Wessel 1799 ) and by pairs of reals (Gauss 1831 ) have become 
known. 

This analogy suggests asking whether, if a proof of consistency in 
Hilbert’s sense should succeed for a portion of classical mathematics 
comprising both real and ideal statements, we could then infer that the 
real statements proved therein by an excursion through the ideal are true 
intuitionistically ? The extent to which we could will be discussed later 
(end § 42 , end § 82 ); it will depend on what reasonings are covered by the 
consistency proof, and what class of statements is being taken as real. 
To this extent, success in Hilbert’s program would give to classical mathe¬ 
matics a role as a method of proof for the intuitionists. 

A sharp controversy arose between Brouwer and Hilbert in the early 
years after Hilbert’s program took shape. Brouwer 1923 said, “An incorrect 
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theory which is not stopped by a contradiction is none the less incorrect, 
just as a criminal policy unchecked by a reprimanding court is none the 
less criminal.” Hilbert 1928 retorted, "To take the law of the excluded 
middle away from the mathematician would be like denying the astrono¬ 
mer the telescope or the boxer the use of his fists.” 

According to Brouwer (1928) and Heyting (1931-2, 1934), agreement 
between intuitionism and formalism is possible, provided (as in von 
Neumann 1931-2) the formalist refrains from attributing to the non- 
intuitionistic classical mathematics a material meaning or content, in 
terms of which the consistency proof justifies it. Such a justification, 
says Brouwer, "contains a vicious circle, because this justification depends 
on the (material (inhaltlichen)) correctness of the proposition that from 
the consistency of a statement the correctness of that statement follows, 
i.e. on the (material) correctness of the law of the excluded middle”, 
which is part of the formalistic mathematics that is to be justified. 

The delicate point in the formalistic position is to explain how the non¬ 
in tuitionistic classical mathematics is significant, after having initially 
agreed with the intuitionists that its theorems lack a real meaning in 
terms of which they are true. 

Classical mathematics constructs theories in quite a different sense 
from intuitionistic mathematics. Hilbert 1928 says, "It is by no means 
reasonable to set up in general the requirement that each separate formula 
should be interpretable taken by itself ....” In theoretical physics "only 
certain combinations and consequences of the physical laws can be 
checked experimentally — likewise in my proof theory only the real 
statements are immediately capable of a verification”. 

A theory in classical mathematics can be regarded as a simple and 
elegant systematizing scheme, by which a variety of (presumably) true 
real statements, previously appearing as heterogeneous and unrelated, 
and often previously unknown, are comprised as consequences of the 
ideal theorems in the theory. (Cf. von Neumann 1947, Einstein 1944 

p. 288 .) 

The example of analytic number theory illustrates that theorems of 
analysis (lacking a meaning acceptable to the intuitionist) often entail 
theorems of number theory, which are meaningful intuitionistically, and 
for which either no non-analytic proofs have been discovered or only 
much more complicated ones. 

For a theory to be valuable in this way, the real statements com¬ 
prised must be true. Formerly mathematicians supposed this to be 
guaranteed by the truth of the theorems which we now recognize as 
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ideal; now we hope to guarantee it instead by a consistency proof. 

By easy stages of transition, the theorizing may climb to higher levels, 
from which it is only very indirectly concerned with systematizing the 
real propositions at the original level, but rather with systematizing ideal 
propositions at intermediate levels. In this connection it is of interest 
whether successively higher theoretical constructions actually add to the 
body of real propositions of the original sort which are comprised, as 
well as whether they do actually permit substantial simplifications of the 
proofs of those previously comprised. (Cf. end § 42 .) 

It is debatable how high a theoretical structure is justified for system¬ 
atizing a given sort of real truths, e.g. whether classical analysis is justified 
as a systematization of number-theoretic truths. Historically analytic 
number theory was a by-product, and the actual impetus to the de¬ 
velopment of classical analysis came from the sciences, including 
geometry in its physical application. 

Hilbert and Bernays 1934 emphasize that in the sciences “we have 
to do ... predominately with theories which do not Teproduce the actual 
state of affairs completely, but represent a simplifying idealization of 
the state of affairs and have their meaning therein” (pp. 2— 3 ). Analysis 
serves as a “formation of ideas (Ideenbildung)”, in terms of which those 
theories can be expressed, or to which they can be reduced by the method 
of models. A proof of the consistency of analysis would assure us of the 
consistency of the idealizations effected in those theories (p. 19 ). 

Weyl (1926, 1928, 1931) observes that in theoretical physics it is not 
the separate statements which are confronted with experience, but the 
theoretical system as a whole. What is afforded here is not a true de¬ 
scription of what is given, but theoretical, purely symbolic construction 
of the world. (Also he argues that our theoretical interest is not ex¬ 
clusively or even primarily in the ‘real statements’, e.g. that this pointer 
coincides with that scale division, but rather in the ideal suppositions, 
e.g. the supposition of the electron as a universal electrical quantum.) 
It is a deep philosophical question what the ‘truth’ or objectivity is 
which pertains to this theoretical world construction going far beyond 
the given. This is closely connected with the question, what motivates 
us to take as basis the particular axiom system chosen. For this con¬ 
sistency is a necessary but not sufficient argument. When mathematics 
is taken for itself alone, he would restrict himself with Brouwer to the 
intuitive truths; he does not find a sufficient motive to go further. But 
when mathematics is merged completely with physics in the process of 
theoretical world construction, he sides with Hilbert. 
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A verdict on the formalists’ thinking will depend partly on the fruits 
of the program they propose. This program calls for a subject called 
“metamathematics”, in which they aim in particular to establish the 
consistency of classical mathematics. 

We note in advance that metamathematics will be found to provide 
a rigorous mathematical technique for investigating a great variety of 
foundation problems for mathematics and logic, among which the con¬ 
sistency problem is only one. For example, metamathematical methods 
are applied now in studies of systematizations of mathematics arising 
from the logicistic and intuitionistic schools, as well as from Hilbert’s. 
(Inversely, metamathematics owes much for its inception to the logicistic 
and intuitionistic investigations.) Our aim in the rest of this book is not 
to reach a verdict supporting or rejecting the formalistic viewpoint in 
any preassigned version; but to see what the metamathematical method 
consists in, and to learn some of the things that have been discovered in 
pursuing it. 

§ 15. Formalization of a theory. We are now about to undertake 
a program which makes a mathematical theory itself the object of exact 
mathematical study. In a mathematical theory, we study a system of 
mathematical objects. How can a mathematical theory itself be an object 
for mathematical study? 

The result of the mathematician's activity is embodied in propositions, 
the asserted propositions or theorems of the given mathematical theory. 
We cannot hope to study in exact terms what is in the mathematician’s 
mind, but we can contemplate the system of these propositions. 

The system of these propositions must be made entirely explicit. Not 
all of the propositions can be written down, but rather the disciple and 
student of the theory should be told all the conditions which determine 
what propositions hold in the theory. 

As the first step, the propositions of the theory should be arranged 
deductively, some of them, from which the others are logically deducible, 
being specified as the axioms (or postulates). 

This step will not be finished until all the properties of the undefined or 
technical terms of the theory which matter for the deduction of the 
theorems have been expressed by axioms. Then it should be possible to 
perform the deductions treating the technical terms as words in themselves 
without meaning. For to say that they have meanings necessary to the 
deduction of the theorems, other than what they derive from the axioms 
which govern them, amounts to saying that not all of their properties 
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which matter for the deductions have been expressed by axioms. When 
the meanings of the technical terms are thus left out of account, we have 
arrived at the standpoint of formal axiomatics (§ 8). 

The technical terms still have grammatical attributes, being nouns, 
adjectives, verbs, etc. Also there remain ordinary or logical terms, whose 
meanings are employed in the deductions. Indeed the point at which 
formal axiomatization stops is arbitrary, in so far as no absolute basis 
exists for the distinction between the technical and the ordinary terms. 

At any rate, we are still short of our goal of making explicit all the 
conditions which determine what propositions hold in the theory. For 
we have not specified the logical principles to be used in the deductions. 
These principles are not the same for all theories, as we are now well 
aware (§ 13 ). 

In order to make these explicit, a second step is required, which com¬ 
pletes the step previously carried out for the so-called technical terms in 
respect to the non-grammatical part of their meanings. All the meanings 
of all the words are left out of account, and all the conditions which govern 
their use in the theory are stated explicitly. The logical principles which 
formerly entered implicitly through the meanings of the ordinary terms 
will now be given effect in part perhaps by new axioms, and in some 
part at least by rules permitting the inference of one sentence from 
another or others. Since we have abstracted entirely from the content 
or matter, leaving only the form, we say that the original theory has 
been formalized. In its structure, the theory is no longer a system of 
meaningful propositions, but one of sentences as sequences of words, 
which in turn are sequences of letters. We say by reference to the form 
alone which combinations of words are sentences, which sentences are 
axioms, and which sentences follow as immediate consequences from 
others. 

Is such formalization possible ? To what extent a given theory can be 
formalized we shall learn only after attempting it and studying the 
results (e.g. §§ 29 , 42 , 60 , 72 ). 

That at least a very considerable measure of formalization is possible 
for mathematical theories is a discovery which has been spread over a 
long stretch of man’s intellectual history. 

The discovery of the axiomatic-deductive method in mathematics 
is attributed by ancient Greek tradition to Pythagoras (sixth century 
B.C.), and comes to us from Euclid ( 365 ?— 275 ? B.C.), whose “Elements” 
is said to have had the greatest circulation of any book except the Bible. 
Euclid failed to make explicit all of the postulates required in the de- 
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duction of his theorems. Others have been brought into the light in 
modern times, e.g. those governing the order of the points on a line by 
Pasch 1882. 

The discovery of the formal treatment of logic, i.e. of the possibility 
of describing deductive reasoning with sentences in terms of their form, 
appears with Aristotle ( 384—322 B.C.). Again there have been modern 
refinements. 

We use both discoveries when we formalize a mathematical theory. 
To do so fully rigorously, it is practically necessary to reconstruct the 
theory in a special symbolic language, i.e. to symbolize it. Instead of 
carrying out the steps described above on the theory as we find it in some 
natural word language, such as Greek or English, we build a new symbolic 
language specially for the purpose of expressing the theory. The natural 
word languages are too cumbersome, too irregular in construction and 
too ambiguous to be suitable. (The symbols in a symbolic language will 
usually correspond to whole words instead of to letters; and sequences 
of symbols which correspond to sentences will be called "formulas”.) 

This new language will be of the general character of the symbolism 
which we find in mathematics. In algebra we perform deductions as formal 
manipulations with equations, which would be exceedingly tedious to 
perform in ordinary language, as some of them were before the invention 
by Vieta ( 1591 ) and others of the modern algebraic notations. The dis¬ 
covery of simple symbolic notations which lend themselves to manip¬ 
ulation by formal rules has been one of the ways by which modern mathe¬ 
matics had advanced in power. However the ordinary practice in mathe¬ 
matics illustrates only a partial symbolization and formalization, since 
part of the statements remain expressed in words, and part of the de¬ 
ductions are performed in terms of the meanings of the words rather 
than by formal rules. 

Since Leibniz ( 1666 ) conceived his idea of a universal characteristic, 
formal logic also has been receiving a symbolic treatment, with the aid 
of mathematical techniques, under De Morgan (1847, 1864), Boole (1847, 
1854), Peirce (1867, 1880), Schroder (1877, 1890-1905) and others. 

These concurrent developments have finally led to formalizations of 
portions of mathematics, in the strict sense, by Frege (1893, 1903), 
Peano (1894-1908) and Whitehead and Russell (1910-13). (The method 
of making a theory explicit which we have been describing is often called 
the logistic method.) 

To Hilbert is due now, first, the emphasis that strict formalization of 
a theory involves the total abstraction from the meaning, the result 
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being called a formal system or formalism (or sometimes a formal theory or 
formal mathematics ); and second, his method of making the formal 
system as a whole the object of a mathematical study called meta¬ 
mathematics or proof theory. 

Metamathematics includes the description or definition of formal 
systems as well as the investigation of properties of formal systems. 
In dealing with a particular formal system, we may call the system the 
object theory, and the metamathematics relating to it its metatheory. 

From the standpoint of the metatheory, the object theory is not 
properly a theory at all as we formerly understood the term, but a system 
of meaningless objects like the positions in a game of chess, subject to 
mechanical manipulations like the moves in chess. The object theory 
is described and studied as a system of symbols and of objects built up 
out of symbols. The symbols are regarded simply as various kinds of rec¬ 
ognizable objects. To fix our ideas we may think of them concretely as 
marks on paper; or more accurately as abstracted from our experience 
with symbols as marks on paper. (Proof theory must be to some extent 
abstract, since it supposes arbitrarily long sequences of symbols to be 
constructible, although the quantity of paper and ink in the world is 
finite.) The other objects of the system are analyzed only with regard 
to the manner of their composition out of the symbols. By definition, 
this is all that a formal system shall be as an object of study for meta¬ 
mathematics. 

The metatheory belongs to intuitive and informal mathematics (unless 
the metatheory is itself formalized from a metametatheory, which here 
we leave out of account). The metatheory will be expressed in ordinary 
language, with mathematical symbols, such as metamathematical 
variables, introduced according to need. The assertions of the meta¬ 
theory must be understood. The deductions must carry conviction. They 
must proceed by intuitive inferences, and not, as the deductions in the 
formal theory, by applications of stated rules. Rules have been stated to 
formalize the object theory, but now we must understand without rules 
how those rules work. An intuitive mathematics is necessary even to 
define the formal mathematics. 

(We shall understand this to mean that the ultimate appeal to justify 
a metamathematical inference must be to the meaning and evidence 
rather thdn to any set of conventional rules. It will not prevent us in 
practice from systematizing our metamathematical results in theorems 
or rules, which can then be applied quasi-formally to abbreviate the in¬ 
tuitive reasoning. This is a familiar procedure in informal mathematics. 
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We shall sometimes even refer to principles of (intuitionistic) logic 
stated formally, when the formal derivation of those principles indicates 
the method by which the reasoning can be carried out informally.) 

The methods used in the metatheory shall be restricted to methods, 
called finitary by the formalists, which employ only intuitively con¬ 
ceivable objects and performable processes. (We translate the German 
“finit” as "finitary”, since the English "finite” is used for the German 
"endlich”.) No infinite class may be regarded as a completed whole. 
Proofs of existence shall give, at least implicitly, a method for con¬ 
structing the object which is being proved to exist. (Cf. § 13 .) 

This restriction is requisite for the purpose for which Hilbert introduces 
metamathematics. Propositions of a given mathematical theory may fail 
to have a clear meaning, and inferences in it may not carry indubitable 
evidence. By formalizing the theory, the development of the theory 
is reduced to form and rule. There is no longer ambiguity about what 
constitutes a statement of the theory, or what constitutes a proof in the 
theory. Then the question whether the methods which have been formal¬ 
ized in it lead to contradiction, and other questions about the effect of 
those methods, are to be investigated in the metatheory, by methods not 
subject to the same doubts as the methods of the original theory. 

The finitary methods are of sorts used in intuitionistic elementary 
number theory. Some formalists attempt to circumscribe them still 
more narrowly (Hilbert and Bernays 1934 p. 43 , and Bernays 1935, 
1938 ). 

We shall leave the discussion of this until later (§ 81 ). For the purpose 
of defending classical mathematics against the intuitionists, there is no 
need to use less than the intuitionists would allow. However it is natural 
to proceed on the basis of strictly elementary methods so long as they will 
suffice. All the examples of intuitionistic number-theoretic reasoning 
given in § 13 we shall take to be finitary. We shall find that up to a late 
stage in our metamathematical investigations, intuitionistic methods of 
an entirely elementaly sort will suffice. The ultimate test whether a 
method is admissible in metamathematics must of course be whether it 
is intuitively convincing. 

(Some authors use "meta-” to identify a language or theory in which 
another language or theory is made the object of a study not restricted 
to finitary methods. Also “syntax language” vs. "object language” is 
used in this connection. Cf. Carnap 1934; also cf. § 37 . In this book, we 
only use "meta-” when the methods are finitary.) 

The formal systems which are studied in metamathematics are (usually) 
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so chosen that they serve as models for parts of informal mathematics 
and logic with which we are already more or less familiar, and from which 
they arose by formalization. The meanings which are intended to be 
attached to the symbols, formulas, etc. of a given formal system, in con¬ 
sidering the system as a formalization of an informal theory, we call the 
(i intended ) interpretation of the system (or of its symbols, formulas, etc.). 
In other words, the interpretations of the symbols, formulas, etc. are the 
objects, propositions, etc. of the informal theory which are correlated 
under the method by which the system constitutes a model for the in¬ 
formal theory. 

In the case of a formula which represents an ideal statement of clas¬ 
sical mathematics (§ 14 ), the interpretation cannot constitute a wholly 
intuitive (or finitary) meaning, but must consist in whatever else it is the 
classical mathematician thinks in terms of in the informal (or not strictly 
formalized) development of classical mathematics, i.e. in the development 
which has taken place historically and takes place currently, when the 
procedure is not being consciously formalized in the strict sense of proof 
theory. 

The interpretation motivates the metamathematician in his choice of 
the particular formal system which he introduces by his definitions. 
It guides him in choosing the problems relating to the system which he 
investigates. It may even provide him with essential clues toward achiev¬ 
ing the solution of those problems. Only in the final statement and proof 
of his results is he prohibited (as a metamathematician) from using the 
interpretation. 

How restrictive is this prohibition? Metamathematics must study the 
formal system as a system of symbols, etc. which are considered wholly 
objectively. This means simply that those symbols, etc. are themselves 
the ultimate objects, and are not being used to refer to something other 
than themselves. The metamathematician looks at them, not through 
and beyond them; thus they are objects without interpretation or 
meaning. 

Now in studying those objects, metamathematics must bring to bear 
its own methods and tools. These may be any that are finitary. For 
example, metamathematics may employ the natural numbers in a 
finitary way. In the case of formulas admitting (outside of metamathe¬ 
matics) a finitary interpretation, it may be possible within metamathe¬ 
matics to define properties of those formal objects which (from outside 
the metamathematics) are equivalent to their interpretations. Thus the 
finitary interpretations may be brought in through the back door. But 
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metamathematics cannot in any way deal with the non-finitary inter¬ 
pretations of the ideal propositions of classical mathematics. 

In order to make it clear all along why we are interested in the formal 
systems which we are considering, and how they constitute formalizations 
of portions of logic and mathematics that we are already familar with 
informally, we shall in this book indicate the possibilities of interpretation, 
and use suggestive terminology, such as “proof” for formal derivations, 
and “and” for the name of the symbol &. This is necessary to our full 
purpose, even though the interpretation is extraneous to the meta¬ 
mathematics itself. 

Let us briefly recapitulate. In the full picture, there will be three sep¬ 
arate and distinct “theories”: (a) the informal theory of which the formal 
system constitutes a formalization, (b) the formal system or object theory, 
and (c) the metatheory, in which the formal system is described and 
studied. 

Here (b), which is formal, is not a theory in the common sense, but a 
system of symbols and of objects built from symbols (described from (c)), 
which however forms a kind of conventionalized image or model for (a). 
On the other hand, (a) and (c), which are informal, do not have an 
exactly determined structure, as does (b). 

Then (c) is a theory with (b) as its subject matter, which must apply 
to (b) without looking at (a), or rqore precisely without looking at the 
interpretation of (b) in terms of (a). 

Furthermore (c) is restricted to the use of finitary methods, while in 
general (a) will not be. 


Note 1: At the top of p. 51, the seeming circularity that not B is used in explaining 
not A is to be avoided thus. Sameness and distinctness of two natural numbers (or 
of two finite sequences of symbols) are basic concepts (cf. p. 51 lines 20-24). For any 
B of the form m — n where m and n are natural numbers, not B shall mean that m 
and n are distinct. The explanation of not A in lines 5-8 then serves for any A other 
than of that form, by taking the B in it to be of that form. Equivalently, since the 
distinctness of 1 from 0 is given by intuition (so not 1=0 holds), not A means that 
one possesses a method which, from any proof of A, would procure a proof of 1 = 0 
(cf. lines 8-9). 




PART II 

MATHEMATICAL LOGIC 




Chapter IV 


A FORMAL SYSTEM 

§ 16. Formal symbols. We shall now introduce a particular formal 
system. The system described in this chapter will be subject matter for 
the four following chapters and parts of later chapters. The system 
constitutes a formalization of a portion of classical elementary number 
theory including the logic required for it. 

In setting up the system, we have made use of Hilbert and Ackermann 
1928, Hilbert and Bernays 1934, 1939, Gentzen 1934-5, Bernays 1936, 
and less immediate sources. 

Our undertaking has two distinct aspects. First, the formal system itself 
must be described and investigated, by finitary methods and without 
making use of an interpretation of the system. This is the metamathe¬ 
matics. Second, an interpretation of the system must be recognized, under 
which the system does constitute a formalization of number theory. 

One approach would stress the second aspect. We could analyze existing 
informal mathematics, selecting and stereotyping fundamental concepts, 
presuppositions and deductive connections, and thus eventually arrive 
at a formal system. 

Here instead, we shall place the initial emphasis on the first aspect. 
The formal system will be introduced at once in its full-fledged com¬ 
plexity, and the metamathematical investigations will be pursued with 
only incidental attention to the interpretation. The reader is asked to 
concentrate on learning precisely what the formal system is, and how it 
is investigated. The interpretation and the reasons for the choices made 
in setting up this particular system will then gradually unfold as we 
proceed. 

The first step in setting up the formal system is to list the formal 
symbols. The list of formal symbols is analogous structurally to the alpha¬ 
bet of a language, although under the interpretation many of the formal 
symbols correspond to entire words and phrases rather than to single 
letters. The list of the formal symbols follows. 

Logical symbols: D (implies), & (and), V (or), -1 (not), V (for all), 
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3 (there exists). Predicate symbols : = (equals). Function symbols'. + 
(plus), • (times), ' (successor). Individual symbols: 0 (zero). Variables', a, 
b, c, .... Parentheses: (, ). 

The words shown parenthetically may be used in reading the symbols, and 
are intended to suggest the interpretations in a preliminary way, e.g. the 
interpretations of the logical symbols as ‘logical constants’. The variables are 
interpreted as ranging over the natural numbers. An infinite list or enumera¬ 
tion of the variables is supposed to be at hand (potentially, cf. § 13 ). 

We reiterate that the interpretations are extraneous to the description 
of the formal system as such. It must be possible to proceed regarding 
the formal symbols as mere marks, and not as symbols in the sense of 
symbols for something which they symbolize or signify. It is supposed 
only that we are able to recognize each formal symbol as the same in 
each of its recurrences, and as distinct from the other formal symbols. 
In the case of the variables this must include our being able to recognize 
a symbol which is a variable to be such. 

The formal symbols constitute the first category of formal objects. 
We derive from this a second category by constructing finite sequences 
(of occurrences) of formal symbols, which we call formal expressions. 
The word “occurrence” is used here to refer to the members of the se¬ 
quence in their status as members, and to emphasize that different 
members may be the same symbol (which agrees with our previous use 
of the term 'sequence', e.g. §§ 1, 2). The formal expressions include those 
consisting of a single (occurrence of a) formal symbol. Except when stated, 
the empty sequence (with no member) will not be included. For example, 
0, (a)+(b), (a)=(0) and (( 0 V 00 = are formal expressions. The last 
consists of seven (occurrences of) symbols, i.e. it has seven members; 
the third, fifth and sixth (occurrences of) symbols in it are each an 
(occurrence of) 0; and the (distinct) symbols which occur in it are (, 0, 
V, =. The formal expressions are analogous structurally to the words of 
a language; but under the interpretation some of them correspond to 
entire sentences, e.g. (a) — ( 0 ), and others are without significance, e.g. 
(( 0 V 00 =. Again our terminology belies the fact that, for the formal 
system as such, the expressions express nothing, but are only certain 
recognizable and distinguishable objects. 

We shall also use, as a third category of formal objects, the finite 
sequences of (occurrences of) formal expressions. 

In discussing the formal objects we shall often, instead of exhibiting 
them, represent (i.e. denote) them by letters introduced for the purpose, 
or by expressions involving letters already so introduced. For example, 
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the letter “s” might be used to represent the formal expression (a)-\-(b), 
and “A” to represent ( a)—Q . Further illustrations will appear presently. 

Letters and expressions so used are not formal symbols and expressions, 
but informal or metamathematical symbols and expressions, which stand 
as names for formal objects. In this, as compared with ordinary informal 
uses of symbolism, there is the new feature that the objects named are 
themselves symbols or objects constructed from symbols. We have thus 
a distinction to preserve between two kinds of symbolism, the formal 
symbolism about which we are speaking, and the intuitive or metamathe¬ 
matical symbolism in which we are speaking about the other. Differences 
in the kinds of type which will be used for the two purposes (a, b, t, x, 
<C?, 2 vs. a, b, t, x, A, B) will assist in keeping the matter straight. 

The use of symbols and expressions to name the objects we are talking 
about should not be considered as novel; our everyday method for 
constructing a sentence about an object requires this. What is novel, 
rather, is the other procedure, which we use somewhat in our meta¬ 
mathematics, of incorporating the object itself, i.e. a specimen of the 
object, directly into the sentence. Although this violates the usual canons 
of grammatical propriety, it is unambiguous when we are engaged in 
metamathematics. For in metamathematics we must treat the formal 
symbols as meaningless, and therefore the formal objects cannot serve as 
names for other objects, and a sentence containing a specimen of a formal 
object can only be about the formal object itself. 

These remarks apply to our metamathematics. In an occasional pas¬ 
sage, concerned with the interpretation and so labeled for the reader, we 
may give the formal symbols an informal status, treating them then as 
meaningful. 

In our metamathematical study of the formal expressions, we shall 
make use of the operation of juxtaposition (or concatenation ), in which 
two or more sequences of formal symbols are combined consecutively 
to produce a new sequence. For example, the juxtaposition of the two 
formal expressions ((0V00== and (a)-{-(b) in that order produces the new 
formal expression (( 0 V 00 =(<z) + (/>); an d the juxtaposition of the seven 
formal expressions (, (a)+(b), ), •, (, ( c)', ) in the given order produces 
the new formal expression ((<*)+(£))•((<:)'). 

When some of the formal expressions to be juxtaposed are being rep¬ 
resented by metamathematical letters or expressions, these latter may 
appear in place of the formal expressions which they represent in writing 
the result of the juxtaposition. For example, if the letter “s” represents 
some formal expression, the result of the juxtaposition of the seven formal 
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expressions (, s,), *, (, (c)', ) is written “(s)-((c)')”. Here "(s)-((c)')” is a 
metamathematical expression representing a formal expression, the formal 
expression represented depending on what formal expression the letter 
“s” represents. In particular, if s is (<*) + (/>), then (s)-((c)') is 
(W+( 4 )H(C)'). 

§ 17. Formation rules. We shall now define certain subcategories 
of the formal expressions, by definitions analogous to the rules of syntax 
in grammar. 

First we define ‘term’, which is analogous to noun in grammar. The 
terms of this system all represent natural numbers, fixed or variable. 
The definition is formulated with the aid of metamathematical variables 
"s” and “t”, and the operation of juxtaposition, as explained above. It 
has the form of an inductive definition, which enables us to proceed from 
known examples of terms to further ones. 

1. 0 is a term. 2 . A variable is a term. 3 — 5 . If s and t are terms, then 
(s)-j-(t), (s)-(t) and (s)' are terms. 6. The only terms are those given 
by 1 — 5 . 

Example 1. By 1 and 2 , 0, a, b and c are terms. Then by 5 , (0)' and 
(c)' are terms. Applying 5 again, (( 0 )')' is a term; and applying 3 , ((c)') + (a) 
is a term. 

We now give a definition of ‘formula’, analogous to (declarative) 
sentence in grammar. 

1. If s and t are terms, then (s) = (t) is a formula. 2 — 5 . If A and B 
are formulas, then (A) D (B), (A) & (B), (A) V (B) and -i(A) are for¬ 
mulas. 6— 7 . If x is a variable and A is a formula, then Vx(A) and 3 x(A) 
are formulas. 8. The only formulas are those given by 1 — 7 . 

Example 2. Using 1 and the examples of terms already obtained, 
( a) — {b) and ({(c)')-\-{a)) = (b) are formulas. Then using 5 and 7 , 
“i (( a)=(b )) and 3 c((((c)') + (<*)) =(b)) are formulas. Finally by an 
application of 2, the following is a formula: 

(A) (M((M')+M)=(*))> 3 (-1 <(«)=(*))). 

The inductive definitions of term and formula have the consequence 
that each term or formula can be built up from 0 and variables by a series 
of steps, each of which steps corresponds to a direct clause of one of those 
definitions (§ 6), and may be called an application of that clause. 

Each step, except an application of 1 or 2 of the definition of term, is 
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§ 17 

of the following kind. At the start we have given an expression or pair 
of expressions previously obtained. We enclose the given expression or 
each of the given expressions in parentheses, and introduce an ex¬ 
pression of one of the ten forms 
(B) D, &, V, -i, Vx, 3 x, =, +, 

where x is a variable. Let us call an expression of one of these ten forms 
an operator. In particular, D, &, V, -i are propositional connectives, and 
operators of the forms Vx and 3 x are quantifiers, Vx being a universal 
and 3 x an existential quantifier ; these six are logical operators. 

The given expression or pair of expressions we call the scope of the 
operator in the resulting expression. By following through the entire 
construction of a term or formula, establishing in the obvious way a 
correspondence between the parts of the given expression or pair of 
expressions and parts of the resulting expression at each step, we are led 
to an assignment of a scope, not merely to the operator last introduced in 
the completed term or formula, but to every operator in that term or 
formula. 

Example 3. In the formula (A) the scope of the first occurrence of 
= consists of the part ((c)')+(<*) and the first occurrence of b, and the 
scope of the 3 c is the part (((c)') + (tf))=(/>)• 

We now state the following fact, the rigorous proof of which we shall 
consider in a moment. In a given term or formula, the scopes of the 
operators can be recognized without ambiguity from the arrangement of 
the parentheses. In other words, the parentheses make it possible, given 
the term or formula as a finite sequence of formal symbols, to recover 
all essential details of its construction under the inductive definitions of 
term and formula. 

The rigorous proof of this fact is afforded by Lemma 2 of § 7 Example 2, 
together with the following lemma which can be proved by induction 
from the inductive definitions of term and formula. 

Lemma 4. In a given term or formula, there exists a proper pairing of 
the parentheses {which are 2 n in number, n being left parentheses and n being 
right parentheses) such that the scope of each operator occurs as follows. 

(a) For operators having one expression as scope, the scope is immediately 
enclosed within paired parentheses, and the operator stands immediately 
outside this pair of parentheses, i.e. immediately to the left of the left paren¬ 
thesis [in the case of -i, Vx, 3 x) or immediately to the right of the right 
parenthesis {in the case of '). 
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(b) For operators having two expressions as scope {namely D, &, V, 
= , + , •), each of the two expressions is immediately enclosed within paired 
parentheses, and the operator stands immediately between the right parenthesis 
of the pair enclosing the left expression and the left parenthesis of the pair 
enclosing the right expression. 

Example 3 (concluded). The displayed example (A) of a formula 
contains 22 parentheses. By Lemma 4, the 22 parentheses admit a 
proper pairing, which is discovered in the process of constructing the 
formula under the definitions of term and formula, and which indicates 
the scopes of the operators. Knowing that there exists a proper pairing, 
by Lemma 2 that pairing is unique, and can therefore be discovered by 
the algorithm of § 7, without prior knowledge of the construction of the 
formula under the definitions of term and formula. We actually did so 
at the end of § 7, where we examined the same 22 parentheses without 
looking at the intervening symbols. Using the resulting pairing of the 
22 parentheses as they occur within the complete formula, we can see 
that the scope of the first occurrence of = consists of the expression 
enclosed by the parentheses ( 3 )J° and the expression enclosed by the 
parentheses (J 1 )* 2 . This agrees with our previous identification of that 
scope. Similarly, the scope of 3c is enclosed by the parentheses (g )g 3 . 

Lemma 3 of § 7, while not necessary to the proof that the scopes can 
be discovered from the arrangement of the parentheses, is useful in 
reasoning about the scopes in parts and the whole of a term or formula. 
For example, if M, N and A are formulas, and A occurs as a (consecutive) 
part, not the whole, of (M) 3 (N), we can infer that this part (or each 
such part) is either a part of M or a part of N. 

In choosing our definitions of term and formula, we of course provided 
the parentheses for the above described purpose of indicating the scopes 
unambiguously. Now evidently more parentheses will usually be in¬ 
troduced under the definitions than are strictly necessary for the purpose. 
Leaving the definitions as they stand, we can agree to omit superfluous 
parentheses as an abbreviation in the writing down of terms and formulas, 
or of metamathematical expressions representing them. 

The possibilities in this direction are extended by employing conven¬ 
tions of a sort familiar from algebra, where “a-h+c” is understood to 
mean (a-t>)+c. We say here that + ranks ahead of •, and rank our 
operators in the order in which we have listed them at (B) above. To restore 
any parentheses which are left out in abbreviating a term or formula, 
one may proceed step by step, each time selecting an operator which of 
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those present comes earliest in the list, i.e. an operator of highest rank, 
and giving it the greatest scope compatible with the requirement that 
the whole be a term or formula. 

We shall not always omit the maximum number of parentheses which 
our convention would allow, but aim at securing maximum readability. 
(With this aim, we also sometimes alter parentheses to square brackets 
or curly brackets.) 

Example 4. Restoring parentheses to “A D BVC&D” gives suc¬ 
cessively “AD(BVC&D)”, “A D ((B V C) & D)”, "(Ap(((B)V(C)) 
& (D))”. We abbreviate the displayed example (A) of a formula as follows: 

(A') 3c(c'-\-a=b) D -ia=b. 

Another kind of abbreviation is afforded by introducing a new symbol, 
with a method for translating an expression containing the new symbol 
back into one without it. For example, we abbreviate the terms (0)', 
((0)')', (((0)')')', ... as“l”,"2”,“3”, .... respectively; and we abbreviate 
the formula -i a—b as “a #/>”, and the formula 3c(c'-\~a=b) as 
"a<b". The displayed formula (A) can then be written: 

(A”) a<b"Da^b. 

The general rule for the abbreviation “ i=" allows us to write “s#t” 
as abbreviation for -is=t whenever s and t are terms. The general rule 
for the abbreviation “<” allows us to write “set” as abbreviation for 
3x(x'-|-s=t) whenever x is a variable and s and t are terms not con¬ 
taining x. In unabbreviating, when the introduction of the abbreviation 
has suppressed a variable, as in the case of “<”, there is an ambiguity 
respecting the variable to be supplied. Thus in unabbreviating “set”, 
we may choose as the x any variable which s and t do not contain. This 
ambiguity is of minor consequence, since the statements we shall wish to 
make about the formula abbreviated will hold regardless of what ad¬ 
missible variable is chosen. 

We shall regard all this abbreviation as merely in the exposition of the 
metamathematics. This is adequate for our purposes, and thereby we 
keep the fundamental definitions, which establish the formal system, 
theoretically simpler. Metamathematical statements about terms and 
formulas of the system are hence to be understood to refer to the un¬ 
abbreviated expressions in the literal sense of the definitions, whatever 
shorthand we may employ in writing the statements. 
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§ 18. Free and bound variables. An occurrence of a variable x 
in a formula A is said to be bound (or as a bound variable), if the occurrence 
is in a quantifier Vx or 3x or in the scope of a quantifier Vx or 3x (with 
the same x); otherwise, free (or as a free variable). 

Example 1 . In 3 c(c'+a=b) 'D -ia=b, both occurrences of a and 
both occurrences of b are free, and both occurrences of c are bound. In 
3 c{c'-\-a=b) 3 ~ya=b-\-c, the first two occurrences of c are bound 
and the third is free. In 3c(3c(c'+d=^) 3 —ta=b-\-c) all occurrences 
of c are bound. 

We also say that any occurrence of a variable x in a term t is free, as 
will follow from the above definition if applied reading “term t” instead 
of "formula A”. The distinction between a free and a bound occurrence 
of a variable is always relative to the term or formula in which it is (at 
the moment) being considered as an occurrence. 

Example 2. The third occurrence of c in 3c(3c(c'+<x=i>) 3 -\a—b-\-c) 
is free when considered as an occurrence in the part c taken by itself or c' 
by itself or c'+a by itself or c'-^a—b by itself, and bound as an oc¬ 
currence in 3 c(c'+a=b) by itself or 3 c(c’+a=b) 3 -i a=b+c by itself 
or in the whole formula. 

A variable x which occurs as a free variable (briefly, occurs free) in A 
is called a free variable of A, and A is then said to contain x as a free 
variable (briefly, to contain x free) ; and likewise for bound variables. 

Example 3. The free variables of 3 c(c'+a=b) 3 -i a=b+c are a, b 
and c, and the only bound variable is c. 

A bound occurrence of a variable x in a formula A is bound by that 
particular one, of the quantifiers Vx or 3x (with the same x) in the 
scope of which it lies, which has the least scope (briefly, by the innermost 
quantifier in whose scope it lies), or in case it is an occurrence in a quan¬ 
tifier Vx or 3x, by that quantifier itself (or the latter binds the former). 

Example 4. In 3c(3c(c'+a=i>) 3 -i a=b+c) the first and fourth 
occurrences of c are bound by the first quantifier 3c, and the second 
and third occurrences of c by the second quantifier 3c. 

In building up a formula under the definitions of term and formula, 
a given bound occurrence of a variable in the resulting formula is bound by 
that one of the quantifiers whose introduction first converted it from a 
free to a bound occurrence (or if it is a variable in a quantifier, by the 
quantifier in which it is introduced). 
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Example 5. Compare Example 4 with Example 2 . 

A few preliminary remarks are offered now on the interpretation of 
free and bound variables (sometimes called ‘real’ and ‘apparent’ variables). 
The remarks are of course not part of the metamathematics, but should 
help to explain the adoption of the metamathematical discriminations. 
An expression containing a free variable represents a quantity or prop¬ 
osition depending on the value of the variable. An expression containing 
a bound variable represents the result of an operation performed over the 
range of the variable. Our bound variables are associated with the logical 
operations of quantification, but examples occur with other sorts of 
operations familiar to mathematicians. In the following n and y are free, 
i and x are bound: 

n rV 

(A) 2 a,-, lim f(x, y), I f{x, y) dx. 

r = 1 a ;—~y 

In the following the occurrence of t as upper limit of the integral is free 
and the occurrences in the integrand are bound: 

(B) ft (t) dt. 

To go a little further with the interpretation, we may note some 
characteristic differences which it imposes on the way we may use the 
two kinds of variables in informal mathematics. A bound variable forms 
part of a circumlocution for expressing the result of an operation carried 
out over the range of the variable, and one can hence change the variable 
to any other having the same range without altering the meaning (subject 
to certain precautions). For example, 

(C) 2 a s , lim f(z, y), ff(t,y)dt 

7 = 1 z —>0 , '—y 

would (ordinarily) mean the same as the respective expressions (A) 
above (but lim f(y, y) is not (usually) the same as lim f(x, y)). If in an 

y —>0 x —>0 

expression we substitute for a free variable an expression representing 
a constant or variable object from its range, we (ordinarily) obtain a 
meaningful result, while such a substitution would result in nonsense if 
applied to a bound variable. For example (substituting in (A)), 

5 j'Z 

(D) 2 a f , lim f(x, 2), / f{x, z) dx 

1 = 1 £“>0 •'-Z 
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are (ordinarily) significant expressions, but not 

(E) S« 5 , lim f(2, y), ff(0,z)d0. 

5 = 1 2—>0 •'-* 

When the same variable occurs both free and bound in an expression, 
the quantity represented by the expression depends only on the value 
of that variable in its free occurrences. Thus the integral (B) is a function 
of t, whose value for t — 3 is 

3 3 

(F) f f{t) dt, not f/(3) dZ. 

■'o •'o 

Substitution. In stating the metamathematical definitions of the 
next section, we shall use an operation of substitution, which we define 
as follows. The substitution of a term t for a variable x in (or synonymously, 
throughout) a term or formula A shall consist in replacing simultaneously 
each free occurrence of x in A by an occurrence of t. To describe this in 
juxtaposition notation, let n be the number of free occurrences of x in 
A (« ^> 0); and write A as "A 0 xA x x. . .A n _ x xA„” showing these oc¬ 
currences (A 0 , A 1( ..., A n _ x , A„ being parts possibly empty containing 
no occurrence of x free relative to A as a whole, and all the n occurrences 
of x shown being free). Then the result of the substitution of t for x in 
A is A 0 tA x t.. ,A n _ x tA n . 

A compact metamathematical notation will be useful in representing 
the result of a substitution. If substitution is to be performed for x, we 
first introduce a composite notation such as “A(x)” for the substituend, 
showing its dependence on x after the manner of notation for functions in 
mathematics (§ 10). The result of substituting t for x in A(x) is then 
written “A(t)'\ 

Example 6. Let x be c, and 
A(x) or A(c) be 3c{c'~\-a—b) 3 -la—b+c. 

Then A(0) is 3c(c'+a—b) 3 ~\a—b-\-0, 
and A (a) is 3c(c'-f(j=/’) 3 -ta=b+a. 

Example 7. Let x be a, and A(x) be a-\-c—a. Then A(0) is0+c^=0, 
and A (b) is b+c—b. 

The substitution which gives A(t) must always be performed for the 
original variable x in the original formula A(x), i.e. for the variable 
and in the formula for which the notation “A(x)” is first introduced. 

Example 7 (concluded). For the above x and A(x), A(c) is c+c=c. 
If we substitute b for c in A(c), we obtain b-\-b—b. This is not the same 
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as A (b), which we obtained above correctly by substituting b for a in A (< 2 ), 
i.e. for the original x in the original A(x). (The same difficulty can occur 
by the misuse of the notation for a function in informal mathematics.) 

We have not required that the variable x actually occur as a free 
variable in A(x). When x is not a free variable of A(x), the result A(t) 
of the substitution is the original expression A(x) itself. 

Similarly, we define substitution performed simultaneously for a 
number of distinct variables; and we shall employ like notations, such as 
"A(x lf .. x„)” for the substituend, and “A(t 1( ..., t n )” for the result. 

Henceforth we shall often introduce these composite notations, such 
as “A(x)'’ or "A(x x ,..., x n )” instead of "A”, when we are interested in the 
dependence of A on a variable x or variables x 1( .. .,x„, whether or not 
we are about to make a substitution. For example, we usually designate 
a formula by “A(x)” instead of "A”, when we want to use it in VxA(x) 
(read "for all x, A of x”, or briefly "all x, A of x”) or 3xA(x) (read "there 
exists an x such that A of x”, or briefly "exists x, A of x’’). We repeat 
that by using “A(x)” (or "Afo,..., x,,)”) we do not imply that x (or 
each of x lt ..., x n ) necessarily occurs free in the formula designated. 

The preliminary remarks on the interpretation shed light on why we 
have elected to define our metamathematical substitution operation as 
applying only to the free occurrences of the variables. 

We now say that a term t is free at the free occurrences of a variable x 
in a formula A(x) (or t is free at the substitution positions for x in A(x), or 
briefly t is free for x in A(x)), if no free occurrence of x in A(x) is in the 
scope of a quantifier Vy or 3y where y is a variable of t (i.e. occurs in t). 

Example 8 . The terms d, d-{- 0' and a d are free for a in the first 
but not in the second of the following formulas: 

(I) 3c(c'-{-a=b) & -\d=0, 3d{d'+a=b) &-id=0. 

Under this definition, when t is free for x in A(x) and only then, the 
substitution of t for x in A(x) will not introduce t into A(x) at any place 
where a (free) variable y of t becomes a bound occurrence of y in the 
result A(t). 

Example 8 (concluded). Substituting d+0' for a in (I) gives 

(II) 3c(c'+(d+0')=b)&-id=0, 3d(d'-\-(d-\-0')—b) & -i</=0, 

respectively. In the first of these, the d of the occurrence of d+0' in¬ 
troduced by the substitution remains free in the whole formula, but not 
in the second. 
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We say that the substitution of t for x in A(x) is free, when t is free for 
x in A(x). With only the smattering of interpretation indicated above, 
it should be clear that a substitution is inappropriate when it is not free. 

The two formulas in (I) mean the same; but the two in (II) do not. 

For an informal example, consider the second expression of (A) or (C). 
This stands for a function of y, call it 

(G) f(y) = Urn f(x, y) = lim f{z, y). 

x —>0 z ~~“>-0 

The value of f(y) for y — z is then given properly by 

(H) f(z) = lim f(x, z), not by f(z) = lim f(z, z). 

x — >0 z — >-0 

Example 9. To illustrate the handling of the terminology and 
notations explained in this section, say that x is (i.e. “x” denotes) a 
variable, A(x) is (i.e. “A(x)” denotes) a formula, and b is (i.e. “b” denotes) 
a variable such that (i) b is free for x in A(x) and (ii) b does not occur free 
in A(x) (unless b is x). Under our substitution notation, since “x” and 
"A(x)” are introduced first, (iii) A(b) is (by definition) the result of 
substituting b for (the free occurrences of) x in A(x). By (i), the occur¬ 
rences of b in A(b) which are introduced by this substitution are free. By 
(ii), there are no other free occurrences of b in A(b). Thus the free oc¬ 
currences of b in A(b) are exactly the occurrences introduced by the 
substitution. Hence (inversely to (i) — (iii)): (iv) x is free for b in A(b), 
(v) x does not occur free in A(b) (unless x is b), and (vi) A (x) is (in fact) 
the result of substituting x for (the free occurrences of) b in A(b). To make 
this example particular, 

x, A(x), b, A(b) 

may be respectively, 

c, 3c{c'+a=b) 3 -\a=b-\-c, d, Jc{c'-\-a=b) 3 -\a—b-\-d. 

§ 19. Transformation rules. In this section we shall introduce 
further metamathematical definitions (called deductive rules or trans¬ 
formation rules) which give the formal system the structure of a deductive 
theory. To emphasize the analogy to an informal deductive theory, we 
shall start with a list of 'postulates’; however, for the metamathematics, 
these are not postulates in the sense of assumptions, as indeed they cannot 
be when officially they have no meaning, but only formulas and forms 
(or schemata) to which we shall refer when we give the definitions. 
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Before giving the postulate list, let us illustrate the types of postulates 
which will appear in the list. The simplest is an ‘axiom’, of which -\a'=Q) 
is an example. This is a formula of the formal system. Then we may have 
an ‘axiom form' or ‘axiom schema’, of which “B D A V B” is an example. 
This is a metamathematical expression, which gives a particular axiom 
each time formulas are specified as represented by the metamathematical 
letters ‘‘A” and “B”. For example, when A is a'=0 and B is -ia’=0, 
we obtain the axiom —>a'=0 D a'=0 V —ia'=0. The axiom schema 
is thus a metamathematical device for specifying an infinite class of 
axioms having a common form. 

We must also have another kind of postulates, which formalize the 
operations of deducing further theorems from the axioms. These are the 
'rules of inference’, of which the following is an example: 

A, A3B 

B. 

This is a schema containing three metamathematical expressions "A”, 
‘ADB” and ‘‘B”, which represent formulas whenever formulas are 
specified as represented by the metamathematical letters “A” and “B”. 
The sense of the rule is that the formula represented by the expression 
written below the line may be ‘inferred’ from the pair of formulas re¬ 
presented by the two expressions written above the line. For example, 
by taking as A the formula -i a !=0 and as B the formula a !—0 V -i<z'=0, 
the rule allows the inference from -ij'=0 and 
-ui'=0 D(j'=0Via'=0 to a'=0 V —\a'=0. Since -\a'=0 and 
-la’—0 D a'=0 V -\a'—0 are axioms (as we just saw), a'=0 V -ia'=0 
is a further 'formal theorem'. (Our terminology will include the axioms 
as theorems.) 

We shall now display the full postulate list, and then give the def¬ 
initions establishing the deductive structure of the formal system by 
referring to the list. The reader may verify that the cumulative effect 
of the series of definitions will be to define a subclass of the class of for¬ 
mulas called ‘provable formulas’ or ‘formal theorems’. 

Postulates for the formal system 

Dramatis personae. For Postulates 1—8, A, B and C are formulas. 
For Postulates 9—13, x is a variable, A(x) is a formula, C is a formula 
which does not contain x free, and t is a term which is free for x in A(x). 
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Group A. Postulates for the predicate calculus. 

Group Al. Postulates for the propositional calculus. 

la. 

A ID (B DA). 

A, A D B 

2 ‘ B. 

lb. 

(A D B) D ((A D (B D 0) D (A D 0). 

3. 

A D (B D A & B). 

4a. A & B D A. 



4b. A&BDB. 

5a. 

ADA VB. 

6. (A DC) D ((Bo C) 

5b. 

BOA VB. 

D(A VBDQ). 

7. 

(ADB) D((AD -iB) D -.A). 

8°. -i-iAoA. 

Group A2. (Additional) Postulates for the predicate calculus. 

o 

C oA(x) 

10. VxA(x) OA(t). 

7. 

C D VxA(x). 

11 . 

A(t) D 3xA(x). 

A(x) DC 

12 . 

3xA(x) DC. 


Group B. (Additional) Postulates 

for number theory. 

13. 

A(0) & Vx(A(x) D A(x')) D A(x). 


14. 

II 

<3 

n 

II 

15. -i0'=O. 

16. 

II 

n 

C) 

II 

n 

II 

S3 

17. a=bz>a'=b'. 

18. 

a-\- 0 =< 2 . 

19. a+b'= (a+by. 

20 . 

0-0=0. 

21 , d'h *= ci'b a . 


(The reason for writing “°” on Postulate 8 will be given in § 23.) 

One may verify that 14—21 are formulas; and that 1—13 (or in the 
case of 2 ,9 and 12 , the expression(s) above, and the expression below, the 
line) are formulas, for each choice of the A, B, C, or x, A(x), C, t, subject 
to the stipulations given at the head of the postulate list. 

The class of ‘axioms’ is defined thus. A formula is an axiom, if it has 
one of the forms la, lb, 3— 8 , 10 , 11, 13 or if it is one of the formulas 
14—21. 

The relation of ‘immediate consequence’ is defined thus. A formula is 
an immediate consequence of one or two other formulas, if it has the form 
shown below the line, while the other(s) have the form(s) shown above 
the line, in 2, 9 or 12. 

This is the basic metamathematical definition corresponding to 
Postulates 2, 9 and 12, but we shall restate it with additional terminology 
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which draws attention to the process of applying the defini. t on. Postulates 
2, 9 and 12 we call the rules of inference. For any (fixed) choice of the A 
and B, or the x, A(x) and C, subject to the stipulations, the formula(s) 
shown above the line is the premise (are the first and second premise, 
respectively), and the formula shown below the line is the conclusion, 
for the application of the rule (or the {formal) inference by the rule). The 
conclusion is an immediate consequence of the premise(s) (by the rule). 

Carnap 1934 brings the two kinds of postulates under the common 
term 'transformation rules’, by considering the axioms as the result of 
transformation from zero premises. 

The definition of a ‘(formally) provable formula’ or ‘(formal) theorem’ 
can now be given inductively as follows. 

1 . If D is an axiom, then D is provable. 2. If E is provable, and D is an 
immediate consequence of E, then D \s provable. 3. If E and F are provable, 
and D is an immediate consequence of E and F, then D is provable. 4. A 
formula is provable only as required by 1 —3. 

The notion can also be reached by using the intermediate concept of 
a ‘(formal) proof’, thus. A ( formal) proof is a finite sequence of one or more 
(occurrences of) formulas such that each formula of the sequence is 
either an axiom or an immediate consequence of preceding formulas of 
the sequence. A proof is said to be a proof of its last formula, and this 
formula is said to be {formally’) provable or to be a {formal) theorem. 

Example 1. The following sequence of 17 formulas is a proof of the 
formula a=a. Formula 1 is Axiom 16. Formula 2 is an axiom, by an 
application of Axiom Schema 1 a in which the A and the B of the schema 
are both 0=0; and Formula 3 by an application in which the A is 
a—b 3 (a=c 3 b=c) and the B is 0=0 3(0=0 3 0=0). Formula 4 is 
an immediate consequence of Formulas 1 and 3, as first and second 
premise respectively, by an application of Rule 2 in which the A 
of the rule is a=b 3 {a=b tDb=c) and the B is [0=0 3 (0=0 3 0=0)] 3 
[a = b 3 {a=c 3 Z>=c)]. Formula 5 is an immediate consequence of 
Formula 4, by an application of Rule 9 in which the x is c, the A(x) is 
a=b 3 {a=c 3 b=c), and the C is 0=0 3(0=0 3 0=0) (which, note, 
does not contain the x free). Formula 9 is an axiom by an application 
of Axiom Schema 10, in which the x is a, the A(x) is y ib'ic[a=b 3 
(a—c 3 / 7 = 0 )], and the t is a -\-0 (which, note, is free for the x in the 
A(x)). The A(t), by our substitution notation (§ 18), is the result of sub¬ 
stituting the t for (the free occurrences of) the x in the A(x), i.e. here 
the A(t) is V/>Vc[d+0=/> 3 (<z+0=c 3 b=c)]. 
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1. a=b 3 (a=c 3 b—c) — Axiom 16. 

2. 0=0 3 (0=0 3 0=0) — Axiom Schema la. 

3. {a=b 3 (a=c 3 b=c)} 3 {[0=0 3 (0=0 3 0=0)] 3 
[a=b 3 (a=c 3 b=c)]} — Axiom Schema la. 

4. [0=0 3(0=0 3 0=0)] 3 [a=£ 3 (<j=c 3/>=c)]—Rule 2, 1,3. 

5. [0=0 3 (0=0 3 0=0)] 3 Wc[a=b 3 (a=c 3 b=c)] — Rule 9, 4. 

6. [0=0 3 (0=0 3 0=0)] 3 'ib'ic[a=b 3 (a=c 3 ^=c)] — Rule 9, 5. 

7. [0=0 3 (0=0 3 0=0)] 3 VaVbVc[a=b 3 (a=c 3 £=c)] — Rule 9,6. 

8. VaV£Vc[a=£ 3 ( a=c 3 £=c)] — Rule 2, 2, 7. 

9. Va'ib'1c[a=b 3 (a=c 3 £=c)] 3 v£Vc [<j+0=£ 3 (<*+0=c 3 b=c)] 
— Axiom Schema 10. 

10. V£Vc[<j+0=£3(<z+0=c3£=c)] — Rule 2, 8, 9. 

11 . VbVc[a+0=b 3 (a+0=c 3 b=c)] 3 Vc[a+0=a 3 (<j+0=c3 a=c)] 
— Axiom Schema 10. 

12. Vc[<H-0=a3(a+0=c3d=c)] — Rule 2, 10, 11. 

13. Vc[0+O=<*3(*+O=c3tf=c)] 3[*+0=tf 3(<i+0=d3d=<j)] — 
Axiom Schema 10. 

14. d - ) _ 0=<j 3 (<3 ~j - 0^= ^2 3 fl=<z) — Rule 2, 12, 13. 

15. <z+0=<j — Axiom 18. 

16. a+0=a3a=a — Rule 2, 15, 14. 

17. a=a — Rule 2, 15, 16. 


Example 2. Let A be any formula. Then the following sequence of 
five formulas is a proof of the formula A 3 A. (In other words, what we 
exhibit below is a ‘proof schema’, which becomes a particular proof on 
substituting any particular formula, such as 0=0, for the metamathe- 
matical letter “A”; and its last expression “A 3 A” is accordingly a 
‘theorem schema’.) Formula 1 is an axiom, by an application of Axiom 
Schema la in which the A and the B of the schema are the A of this 
example. Formula 2 is an axiom, by an application of Axiom Schema lb 
in which the A and the C of the schema are the A of this example, and 
the B of the schema is the A 3 A of this example. Formula 3 is an imme¬ 
diate consequence of Formulas 1 and 2, as first and second premise, 
respectively, by an application of Rule 2 in which the A of the rule 
is the A 3 (A 3 A) of this example, and the B of the rule is the 
[A 3 ((A 3 A) 3 A)] 3 [A 3 A] of this example. 
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1 . AD (A 3 A) — Axiom Schema la. 

2. {A 3 (A 3 A)} 3 {[A 3 ((A 3 A) 3 A)] 3 [A 3 A]} — Axiom Sche- 
ma lb. 

U 3. [A3((A3A) 3 A)] 3 [A 3 A] — Rule 2, 1, 2. 

4 . A 3 ((A 3 A) 3 A) — Axiom Schema la. 

5. A 3 A — Rule 2, 4, 3. 

The terms proof, theorem, etc. as defined for the formal system (i.e. 
formal proof, formal theorem, etc.) must be sharply distinguished from 
these terms in their ordinary informal senses, which we employ in pre¬ 
senting the metamathematics. A formal theorem is a formula (i.e. a cer¬ 
tain kind of finite sequence of marks), and its formal proof is a certain 
kind of finite sequence of formulas. A metamathematical theorem is a 
meaningful statement about the formal objects, and its proof is an 
intuitive demonstration of the truth of that statement. 

We mentioned three categories of formal objects (§ 16), but we shall 
be free to introduce others in the study of them, so long as the treatment 
is finitary. Besides this, a somewhat different extension of our subject 
matter occurs when we discuss the form of our metamathematical def¬ 
initions and theorems in turn. If we chose to be meticulous in our way 
of doing this, it would constitute a metametamathematics. However, the 
same practice is common in (other branches of) informal mathematics; 
and we shall regard such discussions as incidental explanations, intended 
sometimes to make it easier to grasp quickly what is being done in the 
metamathematics, and sometimes to enable us to condense the statement 
of metamathematical theorems which could be stated without them. 
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FORMAL DEDUCTION 

§ 20. Formal deduction. Formal proofs of even quite elementary 
theorems tend to be long. As a price for having analyzed logical de¬ 
duction into simple steps, more of those steps have to be used. 

The purpose of formalizing a theory is to get an explicit definition of 
what constitutes proof in the theory. Having achieved this, there is no 
need always to appeal directly to the definition. The labor required to 
establish the formal provability of formulas can be greatly lessened by 
using metamathematical theorems concerning the existence of formal 
proofs. If the demonstrations of those theorems do have the finitary 
character which metamathematics is supposed to have, the demonstrations 
will indicate, at least implicitly, methods for obtaining the formal proofs. 
The use of the metamathematical theorems then amounts to abbrevia¬ 
tion, often of very great extent, in the presentation of the formal proofs. 

The simpler of such metamathematical theorems we shall call derived 
rules, since they express principles which can be said to be derived from 
the postulated rules by showing that the use of them as additional 
methods of inference does not increase the class of provable formulas. 
We shall seek by means of derived rules to bring the methods for es¬ 
tablishing the facts of formal provability as close as possible to the in¬ 
formal methods of the theory which is being formalized. 

In setting up the formal system, proof was given the simplest possible 
structure, consisting of a single sequence of formulas. Some of our derived 
rules, called ‘direct rules’, will serve to abbreviate for us whole segments 
of such a sequence; we can then, so to speak, use these segments as 
prefabricated units in building proofs. 

But also, in mathematical practice, proofs are common which have 
a more complicated structure, employing ‘subsidiary deduction’, i.e. 
deduction under assumptions for the sake of the argument, which as¬ 
sumptions are subsequently discharged. For example, subsidiary de¬ 
duction is used in a proof by reductio ad absurdum, and less obtrusively 
when we place the hypothesis of a theorem on a par with proved propo- 
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sitions to deduce the conclusion. Other derived rules, called ‘subsidiary 
deduction rules', will give us this kind of procedure. 

We now introduce, by a metamathematical definition, the notion 
of ‘formal deducibility under assumptions’. Given a list D 1 ,...,Dj 
(l > 0) of (occurrences of) formulas, a finite sequence of one or more 
(occurrences of) formulas is called a {formal) deduction from the as¬ 
sumption formulas D x ,..., D { , if each formula of the sequence is either 
one of the formulas D x ,.... D,, or an axiom, or an immediate consequence 
of preceding formulas of the sequence. A deduction is said to be a de¬ 
duction of its last formula E; and this formula is said to be deducible 
from the assumption formulas (in symbols, D 1( .. .,D, j- E), and is called 
the conclusion (or endformula) of the deduction. (The symbol “ [-” may 
be read “yields”.) 

The definitions of deduction and of deducibility are generalizations of 
those of proof and of provability (which they include as the case for 
l = 0) to permit the use of any formulas D x ,..., D, we please, called as¬ 
sumption formulas for the deduction, as pro tempore on a par with the 
axioms. 

Example 1. Let A, B and C be formulas. Then the following se¬ 
quence of five formulas is a deduction of C from the three assumption 
formulas A D (B D C), B and A. (We exhibit a ‘deduction schema’.) 

1. B — second assumption formula. 

2. A — third assumption formula. 

(2) 3. Ad(BdC) — first assumption formula. 

4. B3C — Rule 2, 2, 3. 

5. C — Rule 2, 1, 4. 

Example 2. Let the reader construct: (3) a deduction of A & B 
from A and B; (4) a deduction of C from A & B 3 C, A, B. 

By an analysis of a deduction or proof A x ,..., A k , we mean a spec¬ 
ification, for each j (j — 1, ...,&), either that A s is one of the assumption 
formulas and which one in the list D 1( ..., D,, or that A f is an axiom and 
by which axiom schema or particular axiom of the postulate list, or that 
A, is an immediate consequence of preceding formulas and by which rule 
of inference and of which preceding formulas as the respective premises 
of that rule. In brief, an analysis of a deduction consists of the expla¬ 
nations employed to justify each occurrence of a formula in it (i.e. in our 
examples, the explanations given at the right of the formulas). 

It may occasionally happen that an occurrence of a formula in a de¬ 
duction (or proof) can be justified in more than one way, e.g. the formulas 
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A, B and C might be such that one of the five formulas in ( 2 ) is an axiom. 
Consequently, for some of the discussions below, the procedure to be 
applied to a given deduction is only determined uniquely, when along 
with the deduction itself there is given a particular analysis of it. 

It is to be emphasized that the expression “D 1 ,...,D J [- E”, which 
we use to state briefly that E is deducible from D 1( .... D;, is not a formula 
of the system, but a brief way of writing a metamathematical statement 
about the formulas E, namely the statement that there 

exists a certain kind of a finite sequence of formulas. When 1 = 0, the 
notation becomes “ b E”, meaning that E is provable. The symbol 
" b” goes back to Frege 1879 ; the present use of it to Rosser 1935 * and 
Kleene 1934 *. 

Example 3. The following two statements ( 1 ') and ( 2 ') have been 
justified by exhibiting above the two deductions ( 1 ) and ( 2 ), respectively; 
and (3') and (4') by Example 2. 

(T) [-ADA. (2') AD(BDC), B, Ah C. 

(3') A, B b A & B. (4') A & B D C, A, B b C. 

Notice that the symbol “ b” appears in context preceded by a finite 
sequence of zero or more formulas and followed by a single formula (or 
instead of formulas, metamathematical letters or expressions representing 
formulas). This makes unambiguous the scope of an occurrence of the 
symbol " b” m a metamathematical sentence. In particular, the scopes 
of the formal operators are necessarily confined within formulas of the 
system, while “ b” is a metamathematical verb lying outside any 
formula of the system. 

The definition of ‘deducible from D x ,..., D,’ can also be stated without 
using the intermediate concept of a deduction (cf. the first definition of 
‘provable’ in § 19). We leave it to the reader to state the five clauses 
required. Briefly, “D 1 ; ..D t bE” then means that it is possible to get 
from (zero or more of) the formulas D x , ..., Dj and (zero or more) axioms 
by the rules of inference to the formula E. The two versions of the def¬ 
inition are brought into agreement by observing that, when the formulas 
considered in the process of getting from D 1; ..Dj and axioms to E are 
put down in order of first consideration, we have a deduction of E from 
Dj,..., D;. 

We shall use Greek capital letters, such as ‘T”, “A”, “0”, etc., to stand 
for finite sequences of zero or more (occurrences of) formulas, when we 
wish to indicate sets of assumption formulas without naming the formulas 
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individually (or sometimes 'T(x)”, "A(x 1 ,...,x n )” ) etc., when we wish 
also to emphasize certain variables which may occur in them). 

From the definition of the deducibility relation b, there follow several 
general properties of b which can be seen to be true without reference to 
what particular postulates are in the postulate list for the formal system, 
(i) r b E when E is in the list F. (ii) If F j- E, then A, F f E for any A. 
In particular, we can regard any provable formula as deducible from 
any assumption formulas we please, (iii) If F f- E, then A f- E where A 
comes from F by permuting the formulas F, or omitting any which are 
duplicates of others remaining, (iv) If F b E, then A j- E where A comes 
from r by omitting any of the formulas F which are provable or deducible 
from those remaining. For, given a deduction of E from F, we can obtain 
one from A by inserting into the given deduction, in place of each oc¬ 
currence of an assumption formula which we wish to suppress, a de¬ 
duction of the same from the remaining assumption formulas. These four 
general properties can be analyzed into the simpler ones of the following 
lemma. However (while the inferences we make by general properties of 
b can actually all be made from (i)—(iv) or (I)~ (V)), the reader is en¬ 
couraged to reason flexibly with b on the basis of its meaning. 

Lemma 5. (I) E b E. (II) If F b E, then C, F b E. (HI) If C, C, F b E, 
then C, T b E. (IV) If A, I), C, F b E, then A, C, I), F f E. (V) If A | C 
and C, F b E, then A, T b E. (After Gentzen 1934-5.) 

Example 4. If A b B and A, B, C b F> and B, I) b E, then A, C b E. 
The reader may convince himself of this directly from the meaning of 
b (under both versions of its definition), and also verify that it follows 
from (i) — (iv) and from (I) — (V). 

The definition which we have given for b is relative to a particular 
formal system as determined by a postulate list. Specifically, it is relative 
both to the part of the postulate list which determines the axioms, and 
to the rules of inference. Thus far we have been concerned only with the 
one formal system, but we shall make use of b in like sense in connection 
with other formal systems, e.g. subsystems of that one obtained by con¬ 
sidering only part of the postulate list to be in force for determining the 
class of axioms and the relation of immediate consequence. We shall 
always understand b to be relative to the formal system we are studying 
at the given time. 

Notice that A, F b E for a given system is equivalent to A j- E lot 
the system resulting from the given one by adding the formulas F to the 
set of axioms. 
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§ 21. The deduction theorem. We shall consider the following 
theorem first for the propositional calculus, i.e. with only the postulates 
of Group A1 in force. 

Theorem 1 . For the propositional calculus, if T, A |- B, then T (- A D B. 
(The deduction theorem.) 

Proof. The hypothesis of the theorem says that there is a finite 
sequence of formulas such that: each formula of the sequence is either 
(a) one of the formulas T, (b) the formula A, (c) an axiom, or (d) an im¬ 
mediate consequence by Rule 2 of two preceding formulas (since Rule 2 
is the only rule of inference here); and the last formula of the sequence 
is the formula B. This sequence we shall call the ‘given deduction’ of B 
from T, A. 

The conclusion of the theorem says that there is a finite sequence of 
formulas such that: each formula of the sequence is either (a) one of the 
formulas T, (c) an axiom, or (d) an immediate consequence by Rule 2 of 
two preceding formulas; and the last formula of the sequence is the 
formula A ZD B. This sequence we shall call the ‘resulting deduction’ of 
A D B from T. 

The theorem will be proved by a course-of-values induction on the 
length k of the given deduction (§ 7), taking the B of the theorem to be 
variable, but the P, A fixed for the induction. 

The induction proposition P(k) or P(P, A, k) is: For every formula B, 
if there is given a deduction of B from T, A of length k, then there can he found 
a deduction of A D B from P. 

Basis (to prove the proposition for k = 1, i.e. to prove P(r, A, 1)). 
Suppose given a formula B and a deduction of B from T, A of length 1. 
We distinguish three cases, according to which of the possibilities (a)—(c) 
applies to the last (and since k — 1, only) formula B of the given deduc¬ 
tion. The possibility (d) is excluded here, since B is the only formula. 

For each case, we show how to construct the resulting deduction, 
leaving it to the reader to verify that the sequence of formulas which 
we submit as such does have the required features. 

Case (a): B is one of the formulas T. Then the following sequence of 
formulas is the resulting deduction. 

1. B — one of the formulas T. 

2. BD(AdB) — Axiom Schema 1 a. 

3. A D B — Rule 2, 1, 2. 
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Case (b): B is A. The resulting deduction is the sequence of formulas 
(1) which was given in § 19 Example 2 as a proof of A 3 A. Since B is A, 
the formula A 3 A is A 3 B. 

Case (c): B is an axiom. The resulting deduction is the same as in 
Case (a), except that now the first step is justified on the ground that 
B is an axiom. 

Induction step. Assume (as hypothesis of the induction) that, for 
every / <; k, P(T, A, l ); i.e. that for every l < k and every B, if there is 
given a deduction of B from T, A of length l, then there can be found a 
deduction of ADB from T. Now (to prove P(T, A, k + 1)) suppose 
given a> formula B and a deduction of B from T, A of length k + 1. We 
distinguish four cases, according to which of the possibilities (a) — (d) 
applies to the last formula B of the given deduction. The treatment of 
Cases (a) — (c) is the same as under the basis. 

Case (d): B is an immediate consequence by Rule 2 of two preceding 
formulas. By the statement of Rule 2, we may call these two formulas 
P and PDB. (We use the letter P now, instead of A as in the statement 
of the rule,, since A is reserved here to designate the last assumption 
formula for the given deduction.) If we discard the part of the given 
deduction below the formula P, the part remaining will be a deduction 
of P from T, A of length l <L k. By the hypothesis of the induction 
(with P as its B), we can hence find a deduction of A 3 P from I\ Likewise, 
applying the hypothesis of the induction to the part of the given de¬ 
duction down to PDB inclusive, we obtain a deduction of A 3 (P 3 B) 
from T. We use these two deductions (say they are of lengths p and q, 
respectively) in constructing the resulting deduction, as follows. 

deduction of A 3 P from I\ given by the 
. D f hypothesis of the induction. 

P' A ZD IT 

deduction of A 3 (P 3 B) from T, given by 

, , | the hypothesis of the induction. 

p+q. A3(?3B) J j 

p+q+ 1. (A 3 P) 3 ((A 3 (P 3 B)) 3 (A 3 B)) — Axiom Schema lb. 
£+ 2 + 2 . (A 3 (P 3 B)) 3 (A 3 B) — Rule 2, p, p+q+ 1. 

P~\~q~\~3. A 3 B — Rule 2, P~{~q, P~\~q~\'^ t 

This completes the proof of the theorem by mathematical induction. 
The theorem includes the case that T is empty: For the propositional 
calculus, if Ah B, then |- A 3B. 
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Example. We were able to state, as (2') above, that Ad(B 3C), B, 
A (- C. By Theorem 1, we can thence infer that A D (B D C), B \- A D C. 

To examine this example more closely, let us take the deduction (2), 
which we exhibited in justification of (2'), as the given deduction of C 
from AD(BdC), B, A. By referring to the proof of Theorem 1, we 
should be able to find the resulting deduction of ADC from Ad (BdC), B. 
Since the given deduction is of length > 1, and the last formula comes 
from preceding formulas by an application of Rule 2, the case which 
applies is Case (d) under the induction step. There we find some details, 
and instructions to find the rest by applying Theorem 1 to the deductions 
1 and 1 —4 occurring as parts of (2). Continuing in this manner, we 
eventually obtain the following as the resulting deduction. 

1. B — second assumption formula. 

2. Bd(AdB) — Axiom Schema la. 

3. ADB - Rule 2, 1,2. 

4. Ad (AD A) — Axiom Schema la. 

5. {A D (A D A)} D {[A D ((A D A) D A)] D [A D A]} — Axiom Sche¬ 
ma lb. 

6. [A D ((A D A) D A)] D [A D A] — Rule 2, 4, 5. 

7. AD ((AD A) DA) — Axiom Schema la. 

8. ADA — Rule 2, 7, 6. 

(5) 9. A D (B D C) — first assumption formula. 

10. {A D (B D C)} D {A D (A D (B D C))} — Axiom Schema la. 

11. AD(AD(BDC)) — Rule 2, 9, 10. 

12. {A D A} D {[A D (A D (B D C))] D [A D (B D C)]> — Axiom Sche¬ 
ma lb. 

13. [A D (A D (B D 0)] D [A D (B D C)] — Rule 2, 8, 12. 

14. A D (B D C) — Rule 2, 11, 13. 

15. (A D B) D ((A D (B D C)) D (A D C)) — Axiom Schema lb. 

16. (A D (B D 0) D (A D C) — Rule' 2, 3, 15. 

17. ADC — Rule 2, 14, 16. 

The deduction (5) is not the only deduction of A D Cfrom A D (B D C), B. 
It happens that there is a shorter one, which we obtain from (5) by omit¬ 
ting Formulas 4—8 and 10—14, and citing 9 (instead of 14) as first 
premise for the inference by Rule 2 at Step 17. 

But (5) is the particular one which results by the method used in 
proving Theorem 1 when (2) is taken as given deduction. We have 
carried through the exercise of finding (5) to emphasize the finitary 
character of the reasoning used in proof of Theorem 1, and in particular 
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to show what is involved in the use of mathematical induction. Hence¬ 
forth we shall be satisfied to know that the resulting deductions exist and 
could be found. 

The proof of Theorem 1 will serve as a model for metamathematical 
proofs of certain types. In the future we shall often give such proofs 
in a more abbreviated way, when the reader could cast the argument into 
explicit applications of induction. A few proofs will be set up fully ex¬ 
plicitly as models. 

The above proof of Theorem 1 can be given in a more abbreviated 
way as follows. To each formula of the given deduction of B from T, A, 
let A 3 be prefixed. (In the example, from Formulas 1, 2, 3, 4, 5 of (2), 
we thus obtain Formulas 3, 8, 11, 14, 17 of (5).) The resulting sequence 
of formulas (with A 3 B as end-formula) is not (in general) a deduction 
from T, but can be made one by inserting additional formulas in the 
manner indicated in the treatment of the cases. (This simple plan of proof 
will be modified slightly, when we come to extend the theorem to the 
predicate calculus in § 22.) 

From A 3 (B 3 C), B b A 3 C we infer by a second application of 
Theorem 1 that A 3 (B 3 C) b B 3 (A 3 C). A convenient arrangement 
of these inferences is the following. 

1. A 3 (B 3 C), B, A b C — (2). 

(5') 2. A3(B3C), B b A3C — Theorem 1,1. 

3. A 3 (B 3 C) b B 3 (A 3 C) — Theorem 1, 2. 

In this presentation, we have a sequence of expressions analogous to, but 
on a different level from, the sequence of formulas which constitutes a 
formal proof or deduction. The expressions in this sequence are meta¬ 
mathematical statements about the formal system, while in a formal proof 
or deduction they are formulas of the system. 

Another example of a deduction and of a series of metamathematical 
statements follows. 

1. A & B — second assumption formula. 

2. A & B 3 A — Axiom Schema 4a. 

3. A — Rule 2, 1, 2. 

(6) 4. A 3 (B 3 C) — first assumption formula. 

5. B 3 C — Rule 2, 3, 4. 

6. A & B 3 B — Axiom Schema 4b. 

7. B — Rule 2, 1, 6. 

8. C — Rule 2, 7, 5. 
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1 . AD(BdC), A&B b c — (6). 

1 ' 2. AD(BDC) b A&B DC — Theorem 1, 1. 

As further examples, the reader may establish: 

(7') A & B D C b A D (B D C) — cf. (4') § 20. 

1. ADB, B DC b A DC. 

2. ADB b (B DC) D (A DC) — Theorem 1, 1. 

§ 22. The deduction theorem (concluded). Theorem 1 is a derived 
rule of the subsidiary deduction type (cf. § 20). For an application of the 
rule, the given deduction of B from T, A is the subsidiary deduction ; and 
the deduction of ADB from T obtained from the given deduction by 
the method indicated in the proof of the theorem we have called the 
resulting deduction. When we are stating the existence of deductions 
without actually exhibiting them, we may adopt an elliptical phraseology, 
speaking for example of “the deduction T, A |- B”, when we mean the 
deduction which the statement ‘T, A b B” asserts to exist. 

In Theorem 1, the last assumption formula A of the subsidiary de¬ 
duction T, A b B is not used in making up the list of assumption formulas 
for the resulting deduction T b A D B; accordingly we say this (occurrence 
of A as) assumption formula of the subsidiary deduction is discharged. 
(There might also be occurrences of A in the list T, which would not be 
discharged.) 

In general, a subsidiary deduction rule is a metamathematical theorem 
which has one or more hypotheses of the form A,- b E,- called the sub¬ 
sidiary deductions, and a conclusion of the form A b E called the resulting 
deduction. From each of the subsidiary deductions, one or more as¬ 
sumption formulas may be discharged. 

Example 1. The rule "If T, A b C and T, B b C, then T, A V B b C”, 
which will be established in the next section, has two subsidiary de¬ 
ductions, T, A b C and T, B b C, from the first of which the last as¬ 
sumption formula A is discharged, and from the second the B. 

A metamathematical theorem of the simple form A b E is a derived 
rule of the direct type. It says that it is possible to proceed from the 
formulas A and the axioms directly to E by applications of the rules 
of inference. 

There is the following important difference between these two kinds 
of derived rules. A direct rule necessarily remains true when the formal 
system is enlarged by adding new axioms and rules of inference, since 
the rule states simply that certain deductions can be constructed, and 
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the new postulates only change the situation by providing additional 
means of constructing those same deductions. But a subsidiary deduction 
rule does not necessarily remain true when new postulates are added, 
since the enlargement of the system tends to create new instances of the 
subsidiary deductions, and it becomes a question whether resulting de¬ 
ductions exist to correspond to these new subsidiary deductions. Most 
of the subsidiary deduction rules which we shall state (in particular, all 
those of the present chapter) have an ambiguous set of assumption 
formulas T before the symbol “ h” throughout, so that the addition of 
new axioms can cause no trouble. But the addition of new rules of in¬ 
ference will create new cases to be considered in the proof of the rule. 

We shall next treat Theorem 1 under the condition that all the 
postulates of Group A are in force, either exactly these, or also the ones of 
Group B (which are only an axiom schema and axioms). A certain 
restriction will be required in order to handle the new cases in the proof. 

It seems easier to give this treatment now, while the proof of Theorem 
1 for the propositional calculus is fresh; but some readers may prefer to 
postpone the remainder of this chapter, excepting the parts of § 23 
referring to the propositional calculus, until after Chapter VI. 

We begin by stating some definitions which are useful in formulating 
the restriction. Given a deduction A x ,..., A* from assumption formulas 
D 1( ..., D { and a particular analysis of the deduction (§ 20), we define 
when (an occurrence of) a formula A, in the deduction ‘depends’ on a given 
one D, of the (occurrences of) assumption formulas D x ,.. .,D ( , as follows. 

1. If A, in the given analysis is D }) then A,- depends on D,. 2. If A <x l£ 
depends on D,, and A< in the given analysis is an immediate consequence 
of A (i (or of A,- x and some A u , in either order), then A< depends on D } . 

3. A,- depends on D, only as required by 1 and 2. 

It is easily seen that A< depends on D 3 , if and only if there exists no 
subsequence of the deduction (not necessarily consecutive) which under 
the given analysis constitutes a deduction of A,- from the remaining 
assumption formulas D 1( ..., D,_ x , D m ,..., D,. 

Example 2. In the deduction (6), Formulas 4, 5 and 8 depend on 
the first assumption formula AD(BDC), and the other formulas do not. 
Formulas 1, 6, 7 (with the given analysis) constitute a deduction of 7 
from the other assumption formula A & B. 

We now say that a variable y is varied in a given deduction (with a 
given analysis) for a given assumption formula D y , if (A) y occurs free in 
D„ and (B) the deduction contains an application of Rule 9 or Rule 12 
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with respect to y (as the x for the application of the rule) to a formula 
depending on D ; (as the premise for the application of the rule). Otherwise, 
we say that y is held constant in the deduction for the assumption for¬ 
mula Dy. 

Example 3. Let x be a variable, A(x) a formula, and b a variable, 
such that (i) b is free for x in A(x) and (ii) b does not occur free in A(x) 
(unless b is x); and let C be a formula not containing b free. Then the 
following is a deduction of C 3 VxA(x) from C 3 A(b). In verifying that 
the stipulations for Postulates 9 and 10 are met, we use the facts (iv) — (vi) 
which were worked out in Example 9 § 18. 

1. VbA(b) 3 A(x) — Axiom Schema 10 (noting (iv) and (vi)). 

2. VbA(b) 3 VxA(x) — Rule 9, 1 (noting that, by (v), x does not 
occur free in VbA(b)). 

3. C 3 A(b) — assumption formula. 

4. C 3 VbA(b) — Rule 9, 3. 

6. C 3 (VbA(b) 3 VxA(x)) — from 2 as in Case (a) for Theorem 1 § 21. 

9. C 3 VxA(x) — from 4, 6 as in Case (d) for Theorem 1. 

If A(x) contains x free, then in this deduction b is varied, since (using 
(i) and (iii)) the assumption formula C3A(b) contains b free, and Rule 
9 is applied at Step 4 with respect to b to the premise 3, which depends 
on the assumption formula. But x is not varied, since the premise 1 for 
the application of Rule 9 with respect to x at Step 2 does not depend on 
the assumption formula. 

In a given deduction (with a given analysis), a given variable y is 
always held constant for each assumption formula in which it does not 
occur free, while it may be varied for some of the assumption formulas 
in which it occurs free and held constant for others. 

The above terminology suggests itself, since Rules 9 and 12 (the 
“V-rule” and the “3-rule”) are the only two postulates of Group A in 
which a free variable participates as such. Axiom Schema 10, for example, 
can be applied using a free variable as the t, but in that case the variable is 
used in a way that a term not a variable (such as 0) can equally well be 
used. (The employment of free variables in stating the postulates of 
Group B is inessential.) 

The restriction on Theorem 1 for the predicate calculus is that in the 
subsidiary deduction the free variables should be held constant for the 
assumption formula to be discharged. (This will be explained in terms 
of the interpretation in § 32.) 
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Theorem 1 (concluded). For the predicate calculus {or the full number- 
theoretic formal system), if T, A 1- B with the free variables held constant 
for the last assumption formula A, then T 1- A 3 B. 

Proof is obtained from that given in § 21 by supplying the treatment of 
the two additional cases which can now arise under the induction 
step. 

Case (e): B is an immediate consequence of a preceding formula 
by an application of Rule 9. By the statement of Rule 9, that preceding 
formula is of the form C DA(x), where x is a variable, A(x) a formula, 
and C a formula not containing x free. Then B is C 3 VxA(x). We dis¬ 
tinguish two subcases, according as in the given deduction (for a given 
analysis) that preceding formula C 3 A(x) depends on the last as¬ 
sumption formula A or not. 

Subcase (el): C 3 A(x) depends on A. Then A does not contain x free, 
since otherwise the hypothesis that the free variables are held constant 
for A in the given deduction would be contradicted. Since now neither 
A nor C contains x free, the formula A & C does not contain x free. This 
fact is used below in justifying the new application of Rule 9 at Step 
p+q+\. Applying the hypothesis of the induction to the segment of 
the given deduction ending with the formula CDA(x), we obtain a 
deduction of A 3 (C 3 A(x)) from I\ This deduction is incorporated in 
constructing the resulting deduction, as follows. 

deduction of A 3 (C 3 A(x)) from T, 
given by the hypothesis of the in¬ 
duction. 

p. A 3 (C 3 A(x)) 1 deduction of A & C 3 A(x) from 

• * • > A 3 (C 3 A(x)), given by (6'): 2 (end 

P-\-q. A&C3A(x) of §21). 

p-\-q-\- 1. A & C D VxA(x) — Rule 9, p-\-q. 

.., deduction of A 3 (C 3 VxA(x)) from 

A & C 3 VxA(x), given by (7'). 

P-\-q-\-r-\- 1. A3(C3VxA(x)) 

Subcase (e2): C3A(x) (and hence CDVxA(x)) does not depend 
on A. Then some subsequence of the given deduction constitutes a de¬ 
duction of C 3 VxA(x) from the remaining assumption formulas T. 
We use this in constructing the resulting deduction, as follows. 
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deduction of CDVxA(x) from T, given by the 
< hypothesis of independence. 

p. CDVxA(x) 

p-\- 1 . (C DVxA(x)) d(A3 (CdVxA(x))) — Axiom Schema la. 
p+ 2. A3(C DVxA(x)) — Rule 2 , p, p+ 1 . 

Case (f): B is an immediate consequence of a preceding formula 
by an application of Rule 12 . The treatment of this case is similar, using 
(5') :3 twice in the first subcase. 

The deduction theorem was first proved as a derived rule by Herbrand 
1930 . (Cf. also Herbrand 1928 , Tarski 1930 , Church 1932 , Hilbert- 
Bernays 1934 p. 155, Jaskowski 1934 .) 


§ 23. Introduction and elimination of logical symbols. The 

following theorem contains a collection of derived rules, with row and 
column designations attached to provide convenient descriptive names 
for the rules. For example, “VxA(x) (- A(t)” is the rule of “generality 
elimination’’ or briefly “V-elimination”. 

The variable “x” written as superscript on the symbol “ b” in two 
of the rules is to mark the application of Rule 9 or 12 with respect to x 
in constructing the resulting deduction. 


Theorem 2. For the following rules, A, B and C, or x, A(x), C and t, 
are subject to the same stipulations as for the corresponding postulates (§ 19), 
and T or T(x) is any list of formulas. 

For the propositional calculus, the rules hold from “Implication” to 
“Negation”, inclusive. 

For the predicate calculus (or the full number-theoretic system), all the 
rules hold, provided that in each subsidiary deduction the free variables are 
held constant for the assumption formula to be discharged. 


(Introduction) 

(Implication) If T, A b B, 

then T h A D B. 

(Conjunction) A, B \- A & B. 

(Disjunction) A h A VB. 

B b A V B. 


(Elimination) 

A, ADB b B. 

(Modus ponens.) 

A&BbA. 

A & B b B. 

If T, A b C and F, B b C, 
then T, A V B b C. 

(Proof by cases.) 
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(Negation) If T, A h B and T, A |- -iB, 
then r |- i A. 

(Reductio ad absurdum.) 

(Generality) A(x) b x VxA(x). 

(Existence) A(t) b 3xA(x). 


-i -iA b A. 

(Discharge of 
double negation.) 0 

VxA(x) h A(t). 

If r(x), A(x) b c, 
then T(x), 3xA(x) b x C. 


Proofs. The rule of ^-introduction is Theorem 1. There remain 
ten direct and three other subsidiary deduction rules. The direct rules 
may be established by exhibiting the required deductions. The proofs of 
the subsidiary deduction rules are conveniently presented as sequences of 
metamathematical statements (certain of which statements are to be 
substantiated by exhibiting a deduction as in the proof of a direct rule, 
and others of which follow from preceding of the statements by Theorem 
1 or by general properties of b)- In both cases_ appeal is made at some 
point to a corresponding one of the postulates. These proofs are given 
below for several of the rules of each type, the others being left to the 
reader. However here and in similar situations, the reader is urged first 
to attempt himself even those which we give. 

Direct rules. D-elimination. 

1. A — first assumption formula. 

2. ADB — second assumption formula. 

3. B — Rule 2, 1, 2. 

This rule is simply Rule 2 of the postulate list (the "D-rule”, or 
"modus ponens” of traditional logic) restated as a derived rule. 

&-introduction. We already have this as (3') § 20. 

-i-elimination, or discharge of double negation. 

1. -i -i A — assumption formula. 

2. -i-i AD A — Axiom Schema 8. 

3. A — Rule 2, 1, 2. 

V-introduction. Let C be some axiom not containing x free. 

1. A(x) — assumption formula. 

2. A(x) D (C D A(x)) — Axiom Schema la. 

3. C D A(x) — Rule 2, 1, 2. 

4. CDVxA(x) — Rule 9, 3. 

5. C — an axiom. 

6. VxA(x) — Rule 2, 5, 4. 
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Subsidiary deduction rules. V-elimination. 

1 . T, A I- C — hypothesis. 

2 . T |- AdC — Theorem 1 , 1. 

3. T, B h C — hypothesis. 

4. T 1 - BDC — Theorem 1 , 3. 

5. ADC, BdC h AVBdC — using Axiom Schema 6 and Rule 2 . 

6 . AVB, AVBdC |- C — D-elimination (or using Rule 2). 

7. T, AVB h C — 2, 4, 5, 6 . 

3-elimination. 

1 . r(x), A(x) h C — hypothesis. 

2 . T(x) b A(x) 13 C — Theorem 1 , 1 . 

3. A(x) DC h 1 3xA(x) DC — using Rule 12. 

4. 3xA(x), 3xA(x) DC |- C — D-elimination. 

5. T(x), 3xA(x) b x C — 2, 3, 4. 

Discussion. These rules give a classification ot logical operations as 
introductions and eliminations of the logical symbols, adapted from 
Gentzen 1934 - 5 . 

The rule called “V-elimination” does serve to eliminate a disjunction 
symbol, when it is used as follows. 

1 . h AVB — suppose given. 

2. A |- C — suppose given, with the free variables held constant for A. 

3. B 1 - C — suppose given, with the free variables held constant for B. 

4. A V B C — V-elimination (with T empty), 2, 3. 

5. I-C—1,4. 

This process corresponds to the familiar informal method of proof by 
cases: Either A or B. Case 1: A. Then C. Case 2: B. Then C. Hence C. 

Similarly, 3-elimination, used as follows, eliminates an existence 
symbol. 

1 . b 3xA(x) — suppose given. 

2 . A(x) b C — suppose given, where C does not contain x as a free 
variable, and with the free variables held constant for A(x). 

3. • 3xA(x) b C — 3-elimination, 2 . 

4. bC- 1,3. 

This corresponds to the familiar argument: There exists an x such 
that A(x); consider such an x. Then C, which does not depend on x. 
Hence C. 
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Similarly, -(-introduction corresponds to the method of reductio ad 
absurdum. 

Using the T provided for in the theorem, any of these procedures can 
be carried out in the presence of any list of additional assumption formulas. 

The following shows that A, -iA |- B (in words: from a contradiction 
A and -iA, any formula B is deducible). This we shall cite as the rule 
of weak -elimination. 

1. A, -iA, -iB b A. 

2. A, -(A, -iB |- -tA. 

3. A, iA h iiB — -i-introduction, 1, 2. 

4. —i —i B b B — -i-elimination. 

(9') 5. A, -iA (- B — 3, 4, as was to be proved. 

Step 3 amounts to blaming the formula -> B for the contradiction 
A and -i A of 1 and 2. Continuing, we have: 

6. -iA b ADB — D-introduction, 5. 

7. |- nAD(A3B) — D-introduction, 6. 

Our formal system was intended as a formalization of number theory, 
including methods only accepted under the classical viewpoint (cf. § 13). 
However, if Axiom Schema 8 (nAOA) is replaced by the following 
(cf. (9'): 7), all the postulates express principles also accepted by the 
intuitionists (cf. end § 30): 

8 1 . -iAD(ADB). 

In terms of the derived rules of Theorem 2, this means replacing 
-(-elimination by weak -i-elimination. When we wish to consider this 
system also, we call the original system with Postulate 8 the classical 
system, and the system with Postulate 8 1 instead the {corresponding) 
intuitionistic system. Our results are marked with the symbol in every 
case when the demonstration we give is not valid for both systems, but 
only for the classical (and no demonstration which the reader is expected 
to discover for himself is available for the intuitionistic system). 

Use of V-introduction followed by V-elimination gives us the following 
rule. 

Substitution for an individual variable. If x is a variable, A(x) 
is a formula, and t is a term which is free for x in A(x): A(x) b x A(t). 

We shall abbreviate the presentation of applications of our derived 
rules, using tacitly general properties of b- 
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Example 1. Consider the following argument. 

1. A, B h C — suppose this given. 

2. A&B |- A — &-elimination. 

3. A & B 1- B — &-elimination. 

4. A&B b C — 1, 2, 3. 

We condense this as follows. 

1. A, B 1- C — suppose this given. 

2. A&B h C — &-elim., 1. 

Given ‘T 1- P, P b Q” (which means: T b P and P b Q), we 
condense to ‘T b P b Q”; and similarly with longer chains of deductions, 
each of which after the first has as its only assumption formula the 
conclusion of the preceding. (But ‘T |- P, b Q” means: T b P and b Q-) 

Example 2. 

1 . AdB, A b B — 3-elimination. 

2. B b B V C — V-introduction. 

3. ADB, A b B VC — 1, 2. 

We condense this to: 

1. ADB, A h B (- BVC — D-elim., V-introd. 

*§ 24. Dependence and variation. For the predicate calculus, in 
order to use a deduction obtained (i.e. proved to exist) by one of the 
derived rules of Theorem 2 as a subsidiary deduction for a new application 
of one of the rules, we shall need (so far as our information goes) to know 
not only that the deduction exists, but also that the free variables are 
held constant for the assumption formula to be discharged. 

In order to have such information on hand when it is needed, we shall 
make it a practice in applying the rules to keep track of all cases when 
a variable may be varied in the resulting deduction. It is convenient to 
do this by writing any variables which may be varied as superscripts 
on the symbol “b”. This notation is not fully explicit, as it does not 
show for which of the assumption formulas a given superscript variable 
may be varied. We may then simply associate the superscript with the 
assumption formulas in which the variable occurs free. (When there is 
occasion to be more explicit, the facts may be stated verbally, e.g. as in 
Lemma 8a‘below.) 

We recall that, under the definition of variation (§ 22), a variable y 
can be varied only for an assumption formula D, in which it occurs free. 

It is easily seen that, given any deduction D lf ..., D 4 b E, assumption 
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formula D 3 , and variable y that occurs free in D 3 , one can find another 
deduction of E from D x ,..D 2 in which y is varied for D 3 . (Hint: intro¬ 
duce into the deduction some superfluous steps.) Therefore our interest 
will always be in whether there is some deduction Dj,..., D t h E in 
which y is not varied for D 3 ; and our statements to the effect that a 
variable y is varied only for such and such assumption formulas will 
mean that there is some deduction (with the given assumption formulas 
and conclusion) in which this is the case. 

Similarly, given any deduction D 1 ( ..D, h E, it is always possible 
to find another deduction of E from D x ,... Dj in which the conclusion 
E depends on D*. 

The procedure for keeping track of variation is entirely straightforward 
(likewise for dependence). Thus far our only derived rules which call for 
introducing a superscript are V-introduction, 3-elimination and sub¬ 
stitution. (Even then the superscript is not always necessary, e.g. when 
the A(x) for an V-introduction or substitution does not contain x free. 
Also cf. Lemma 7b below.) Moreover, once superscripts have been in¬ 
troduced, we must carry them forward in the obvious way from given de¬ 
ductions to resulting deduction (unless some reason to the contrary can 
be given), both in applying the subsidiary deduction rules of this section, 
and in combining deductions by general properties of (- (§ 20). 

The situations which arise in practice are simple enough so that we 
have little trouble in seeing what is happening. Variables which are being 
varied have usually just previously been introduced in that role for some 
immediate purpose, so that they are not likely to be overlooked. How¬ 
ever, to make the theory of our derived rules complete, the facts are stated 
in more detail in the following lemmas. 

Lemma 6. In Theorem 1 , A D B depends on a given one of the formulas 
T in the resulting deduction T (- A D B, only if B depends on the same one 
in the given deduction T, A |- B. Similarly, in the other subsidiary deduction 
rules of Theorem 2, the conclusion depends on a given one of the T's in the 
resulting deduction, only if the conclusion depends on the same one in the 
given deduction (or in one at least of the two given deductions). 

For otherwise that assumption formula could be omitted from the 
Fs in applying the rule, and afterwards introduced by (II) (and (IV)) of 
Lemma 5. 

In V-elimination, if the C does not depend both on the A in T, A f- C 
and on the B in T, B |- C, the V-elimination can be avoided altogether. 
Similarly in 3-elimination, if the C does not depend on the A(x). 
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Lemma 7a. In Theorem 1 , a variable is varied for a given one of the 
V’sin the resulting deduction T (- A ZD B, only if it is varied for the same one 
in the given deduction F, A [- B. Similarly in the other subsidiary deduction 
rules of Theorem 2, except that for the variable x of 1-elimination the situation 
is as stated in Lemma 7b. 

Lemma 7b. In 1-elimination, the x is varied in the resulting deduction 
r(x), 3xA(x) |- x C only for those of the r(x)’s which contain the x free and 
on which the C depends in the given deduction r(x), A(x) |- C. {In 1-elimi¬ 
nation, no variable is varied for the 3xA(x); and likewise in V -elimination 
for the A V B.) 

These two lemmas may be verified by examining the proofs of Theorems 
1 and 2 . For 3-elimination, if the x is varied in the given deduction for any 
one of the r(x)’s on which the C does not depend, that one of the T(x)’s 
may be omitted for the 3-elimination. 

The discussions of dependence and variation under steps performed 
by general properties of \- (using the list of such properties provided in 
Lemma 5) are left to the reader, excepting that of variation for the A’s 
of (V). Before treating this (in Lemma 9), we prove the following basic 
lemmas. 

Lemma 8 a. If 

(I) D 1( D 2 ,..., Dj [- E, 

where, for j = 1 ,...,/, only the distinct variables y jlt ..., y jp . are varied for 
D ? . {but y^,..., y . p , need not be distinct from the variables y kl , ..., y kPk for 
j ^ k), then 

(II) b Vy u ... Vy lp p 1 3 (Vy a ... Vy 2p D 2 3 ... (Vy u ... Vy Zp Dp E)...); 
and conversely. 

For (II) follows from (I) by V-eliminations and 3-introductions. 
Conversely, (I) follows from (II) by V-introductions and 3-eliminations. 

Lemma 8 b. Given a deduction of E from D x ,.... Dj, another deduction 
of E from D x , ..., D ; can be found in which, for j = 1 ,...,/, Rule 9 is 
applied, to premises dependent on D„ only with respect to variables which are 
varied for D, in the given deduction, and Rule 12 is applied to no premise 
dependent' on an assumption formula. 

We take the given deduction as (I) for Lemma 8 a, pass to (II), and 
thence conversely back to another deduction (I). In this the applications 
of Rules 9 and 12 which come from the proof (II) are to premises dependent 
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on no assumption formulas. Those which come from the V-introductions 
used to obtain (I) from (II) are exactly as described in the present lemma. 

Lemma 9. In (V) of Lemma 5, a variable y is varied for a given one of the 
A ’s in the resulting deduction A, T b E, only (a) if y is varied for the same 
one of the A’s in the first given deduction A (- C, or (b) if y is varied for C 
in the second given deduction C, T (- E, and C depends on that one of the 
A’s in the first given deduction A \- C, and that one of the A’s contains y free. 

Lemma 9 would be immediate, except for the following contingency. 
In the second given deduction C, T |- E there might be an application of 
Rule 9 or 12 with respect to y to a premise dependent on C, and yet y 
be held constant for C because C does not contain y free. If this deduction 
were combined with a deduction A f- C in which C depends on one of the 
A’s containing y tree, then y would be varied for that one of the A’s in the 
resulting deduction A, T |- E. But we can use Lemma 8b to replace the 
given deduction C, T f- E by another, after which the contingency de¬ 
scribed cannot arise for any variable y. 

Hereafter, for new derived rules the facts respecting dependence and 
variation will be as one would expect, with any exceptions noted, and all 
cases when variation may be introduced indicated by superscripts on “ 

In general, in a subsidiary deduction rule having assumption formulas 
for given and resulting deductions in obvious correspondence: The con¬ 
clusion depends on (A given variable is varied for) a given one of the as¬ 
sumption formulas of the resulting deduction, only if it does on (is for) the 
corresponding assumption formula of the given deduction or of either 
given deduction. Examples are Theorems 3 and 4 § 25 (for dependence), 
15 and 16 §34, the formal induction rule §38, 41 (b) and (c) §73, 42 
(III)—(V) and 43 (Vila)—(VUIb) §74, 59 and 60 (b2)—(d) §81. 

Strong V-introduction and 3-elimination. Occasionally it is useful 
to employ V-introduction and 3-elimination in a slightly strengthened 
version, which permits a change in the variable. 

Lemma 10. Let x be a variable, A(x) a formula, and b a variable, such 
that (i) b is free for x in A(x) and (ii) b does not occur free in A(x) (unless 
b is x). Furthermore, for the 3- elimination rule, let C be a formula not con¬ 
taining b free, and let the free variables be held constant for A(b) in the sub¬ 
sidiary deduction. Then : 

A(b) h b VxA(x). If T(b), A(b) f- C, then T(b), 3xA(x) h b C. 

(Strong V-introduction.) (Strong 3-elimination.) 

Proofs. In Example 3 §22 we derived the rule C D A(b) |- b C 3 VxA(x). 
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Using this instead of the postulated rule 9 in our former proof for V- 
introduction (§ 23), we obtain the strong version. 

Derived rules and respective postulates. We call Postulates 
la, lb and 2 the postulates for 3, Postulates 3, 4a and 4b the &c-postulates, 
Postulates 7 and 8 (or for the intuitionistic system 7 and 8 1 ) the -i- 
postulates, etc. Postulates for 3 are used in establishing all the derived 
rules of Theorem 2. 

Lemma 11 . For each selection of one or more of the logical symbols 
3, &, V, -i, V and 3: The rules of Theorem 2 for 3 and those symbols 
(:including for V and 3 the strong versions, and for —i in the intuitionistic 
system the weak -elimination rule in place of the other) hold good in the 
formal system which has as postulates only the 3 -postulates and the postu¬ 
lates for the symbols in question, provided that in case the symbols include 
V but not & the V -postulates include an additional axiom schema as follows, 
where x, A(x) and C are subject to the same stipulations as for Rule 9: 

9a. Vx(C 3 A(x)) 3 (C 3 VxA(x)). 

Proof. This may be verified by a perusal of the above proofs of the 
rules, with an exception in the case the symbols include V but not &, 
since the treatment of Case (e) Subcase (el) of Theorem 1 (§ 22) entails 
use of the &-postulates. With the additional V-schema, however, that 
can be replaced by the following. 

p. A 3 (C 3 A(x)) — as before. 

>+l. A 3 Vx(C3 A(x)) — Rule 9, p. 

p+2. Vx(C 3 A(x)) 3 (C 3 VxA(x)) — Axiom Schema 9a. 

deduction from p+1 and p+2 
given by (8') :1 (end § 21). 

p+q+ 2. A 3 (C 3 VxA(x)). 

Deductions in tree form. We have been taking a deduction to 
be a linear sequence of (occurrences of) formulas. Sometimes it is useful 
instead to consider the (occurrences of) formulas in a partial ordering 
which represents the logical structure directly. In this ordering, the 
premises for each inference are written immediately over the conclusion, 
as in the statement of the rules of inference; and no (occurrence of a) 
formula serves as premise for more than one inference. A deduction (or 
proof) in the former arrangement we say is in sequence form ; in this, in 
tree form. 

The method of converting a deduction of E from T given in sequence 
form, with a given analysis, into one in tree form (called “resolution into 



§24 


DEPENDENCE AND VARIATION 


107 


proof threads” by Hilbert and Bernays 1934 p. 221 ), and inversely, will 
be clear from an example. 

Example 1 . Consider the deduction (6) of §21. By the analysis, 
the bottom formula 8 is an immediate consequence of 7 and 5. Let us 
write 7 and 5 immediately above 8 . Then 7 is an immediate consequence 
of 1 and 6 ; so we write 1 and 6 just over 7; etc. Looking just at the 
numbers 1 — 8 , we obtain the following figure. 

(a) J_ 2 _ 

16 3 4 

7 5 

8 


Writing in the formulas themselves (with new numbers 1 ' — 9' and the 
analysis), we have the deduction in tree form. 

(b) second 

assumption Axiom 

second formula Schema 4a first 

assumption Axiom 4'. A&B 5'. A&B DA assumption 

formula Schema 4b - 2 formula 

1'. A&B 2'. A&B D B 6 '. A 7'. AD(B3C) 

- 2 -—-- -2 


3'. B 


8 '. B DC 


9'. C 


Inversely, from this deduction in tree form of C from A D (B D C) 
and A&B, by arranging the (occurrences of) formulas as a linear se¬ 
quence say in the order of the numbers 1'—9', we obtain one in sequence 
form (not the original one). 

Briefly described, a branch of a deduction in tree form consists of the 
(occurrences of) formulas in linear sequence, passing downward within 
the tree structure, beginning with a formula occurring as an axiom or 
assumption formula, and terminating in the conclusion (or endformula) 
of the deduction. The height of a deduction in tree form is the length of a 
longest branch (or in other words, the number of levels). An (occurrence of) 
a formula is said to be above another (or the latter to be below the former), 
if the former is above the latter in the same branch. 


Example 1 (concluded). The deduction (b) has 5 branches, namely: 
1', 3', 9'; 2 ', 3', 9'; 4', 6 ', 8 ', 9'; 5', 6 ', 8 ', 9'; 7', 8 ', 9'. The height is 4. 
The (occurrence of a) formula 4' is above 8 ' but not above 3'. 



Chapter VI 


THE PROPOSITIONAL CALCULUS 

§ 25. Proposition letter formulas. In this chapter we single out 
for intensive study that part of the formal system which is obtained by 
using only the postulates of Group Al. The meanings of ‘provable’, 
‘deducible’ and ‘ \-’ are to be understood accordingly. 

Under the definition of 'formula’ which was given for the full system 
in § 17 our formulas are all built up in terms of the number-theoretic 
symbolism. But so long as we are using only the postulates of Group Al, 
many details of this symbolism are irrelevant. 

It is undesirable that we should restrict the generality of our treatment 
of the propositional calculus because we intend applying it in the number- 
theoretic system. On the other hand, we must prepare the ground for 
that application. 

We now give, for use in the propositional calculus, an alternative 
definition of ‘formula’, which eliminates the irrelevant details of the 
number-theoretic definition. 

We start by introducing formal symbols of a new kind, 

<3, 2, C, ..., 

called proposition letters, of which we suppose a (potentially) infinite 
list to be available. The new definition of ‘formula’ follows. 

1 . A proposition letter is a formula. 2 —5. If A and B are formulas , 
then (A) 3 (B), (A) & (B), (A) V (B) and “ 1 (A) are formulas. 6 . The only 
formulas are those given by 1 — 5. 

Comparing this with the definition in § 17, Clause 1 of that definition 
is replaced by the new Clause 1 , and Clauses 6 — 7 are suppressed. When 
we wish to distinguish between the two notions of formula, we shall 
call that of § 17 number-theoretic formula, and the present one proposition 
letter formula. 

Example L L7 V (-icT & S) is a proposition letter formula (pa¬ 
rentheses being omitted in continuation of practices established in § 17). 

We henceforth agree, for this chapter, that when we say “formula” 
without specifying a particular sense, the word may be read either in 
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the sense of proposition letter formula, or in the sense of predicate letter 
formula to be defined in the next chapter, or in the number-theoretic 
sense. (It might be read in still other suitable senses. But for definiteness 
we restrict it here to these three, leaving the question whether another 
sense is suitable to be considered when one has another sense in mind.) 

Results stated in this chapter using simply “formula” will thus apply, 
at no extra expense, to any one of three formal systems, having in com¬ 
mon Group A 1 as postulate list, but differing in the sense of formula. 
These three systems we may distinguish respectively as the pure propo¬ 
sitional calculus, the predicate letter propositional calculus and the 
number-theoretic propositional calculus. 

Some of our results, however, will be stated using “proposition letter 
formula”. These will also apply generally. The only difference in the case 
of these is that it is easier to explain them in terms of proposition letters, 
leaving it to the reader to translate them to other senses of formula, when 
he needs to, by means of two general rules for translation which we shall 
next provide (Theorems 3 and 4). 

Let Pj, ..., P m be a list of distinct proposition letters. (Here “Pi”, ..., 
“P m ” are metamathematical letters, used as names for proposition let¬ 
ters when we do not wish to limit our discussion by using particular 
proposition letters.) 

A proposition letter formula A is said to be a proposition letter formula 
in Pj, ..., P m , if no proposition letters other than P x , ,,., P m occur in A. 

Example 2. LZ V (-i<CT & S) is a proposition letter formula in 

a, 8, C. 

Substitution for a proposition letter (or simultaneously for several 
distinct proposition letters) is defined as for a variable in § 18, except 
that it applies to all occurrences without exception (there being here no 
'bound occurrences’). Also, later in the section, we use an operation called 
replacement in all occurrences of a formula (or simultaneously of several 
distinct formulas), defined similarly (the x in the definition of § 18 
becoming a formula); this operation is unambiguous because the oc¬ 
currences will be non-overlapping. 

Theorem 3. Substitution for proposition letters. Let Y be 
proposition letter formulas, and E a proposition letter formula, in the distinct 
proposition letters P x , ..., P m . Let A v .... A m be formulas. Let T* and E* 
result from Y and E, respectively , by substituting simultaneously A 1( ..., A m 1 
for P x , ..., P m , respectively. If Y |- E, then T* h E*. (For the case that Y 
is empty: If b E, then f- E*.) 
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Proof. For the postulates of Group Al, nothing is required of the 
A, B, C which appear in those postulates except that they be formulas 
fixed throughout a given application of a postulate. Now consider the 
given deduction of E from T in the pure propositional calculus. The for¬ 
mulas T and the formula E are proposition letter formulas in the distinct 
proposition letters P x , . ..,P m , and besides these some more P m+1 , 
.... P m+r may occur in other formulas of the deduction. Let A m+1 , .. 
A m+r be any formulas. For each of the proposition letters P 1( ..., P m+r , 
let the respective formulas A 1( ..., A m+r be substituted throughout every 
formula of the given deduction. For each application of a postulate in the 
given deduction, the A, B, C of the application will be transformed by 
the substitution into expressions A*, B*, C* (every occurrence of A 
becoming an occurrence of A*, etc.). These expressions A*, B*, C* will be 
formulas, since to each application of one of Clauses 2 — 5 of the defini¬ 
tion of proposition letter formula used in building up A, B, C from the 
proposition letters P 1( ..., P m+r there will correspond an application of 
the same-numbered clause of the definition of formula used in building 
up A*, B*, C* from the formulas A x , .. ., A m+r . Hence we shall have again 
an application of the same postulate. Thus the sequence of formulas 
into which the given deduction is transformed is again a deduction with 
the same analysis. It is a deduction of E* from T*. 

Example 3. To illustrate the proof of the rule, consider the following 
deduction of 3 D B from B. 

1. B — assumption formula. 

(a) 2. B D (3 D B) — Axiom Schema 1 a. 

3. dDB — Rule 2, 1,2. 

On substituting B, —1<3 & C (or 3 c(a—c'), —\a—0) for <3?, B, we obtain 
the following deduction (b) (or (c)) with the same analysis as (a). 

1. ~& C — assumption formula. 

(b) 2. -i3 & C D (B 3-i(T & C) — Axiom Schema la. 

3. BD-!c3& C — Rule 2, 1, 2. 

1. —\a=0 — assumption formula. 

(c) 2. —\a=0 D (3c(a=c') 3 -uz=0) — Axiom Schema la. 

3. 3c{a=c') Z) -uz=0 — Rule 2, 1, 2. 

To illustrate the application of the rule with the same two examples, 
we know (by reference to (a)) that 

(a') B h 3 D B. 
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The rule allows us to infer, by the substitutions described, 

(b') -ic2&C (- B 

(c') -i<z =0 l- 3 c(a=c') D -ia~0. 

In one case ((a') to (b')), we are using the rule within the pure proposi¬ 
tional calculus, for which it is a derived rule of the subsidiary deduction 
type. In the other case, by substituting formulas in another sense, we 
have used the rule to infer (c') in the number-theoretic propositional 
calculus from (a') in the pure calculus. 

Evidently (a'), (b'), (c'), etc. can all be included in the statement: 
If A and B are formulas, then 

(10') B b ADB. 

The sense of the rule appears as simply that, having established a de¬ 
ducibility relationship in terms of particular proposition letters <3, 
B, C, ..., we can assert the same relationship in the form of a schema with 
metamathematical letters "A”, "B”, “C”,.. . representing any formulas. 

Combining this remark with our earlier observation that a rule of the 
direct form T b E always remains valid in the presence of additional 
postulates (§ 22), we see that all results of the form r (-E obtained in 
this chapter (whether or not stated in terms of proposition letters) will 
hold good for later chapters where we take into account more of the 
structure of the formulas and a larger part of the postulate list of the 
original formal system. 

Remark 1 . Within the pure propositional calculus, substitution can 
be performed for a single variable P, at a time, by using the rule with 
Pi,..., P/_i, Py+i> • • • i Pm 35 the Aj,..., Ay_j, Ay_j_i,...» A m . 

A formula will be said to be prime (for the propositional calculus ), if 
it does not have any one of the forms ADB, A & B, AVB, -i A where 
A and B are formulas. 

Example 4. < 3 = 0 , 3 c{a—c') and Vc(a=c' V a=b) are prime, but 
-i< 3=0 and -ia=0 & 3c(a=c') are not. A proposition letter formula 
is prime, only when it consists simply of a proposition letter. 

From the fact that the scopes of the operators D, &, V, -i in a formula 
can be recognized without ambiguity (§ 17), it follows that any given for¬ 
mula is constructed in a uniquely determined manner out of prime 
formulas by applications of Clauses 2 — 5 of the definition of formula. 
We call the distinct prime formulas out of which a formula or several for- 
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mulas are thus constructed the distinct prime components (for the propo¬ 
sitional calculus) of that formula or set of formulas. 

Example 5. The distinct prime components of a— 0V(-id=0 
& 3 c(a=c')) 3 Vc(a=c' V a=b) are a= 0, 3 c(a—c'), Vc(a=c' V a—b). 

Theorem 4. Converse of substitution for proposition letters. 
Under the same stipulations as in Theorem 3, and provided in addition that 
A x , ..., A m are distinct prime formulas: If T* p E *,thenT (- E. 

Proof. Consider a given deduction of E* from T*. The distinct prime 
components of the formulas T* and E* are the formulas A t , .... A m . 
Other formulas of the deduction may contribute additional distinct 
prime components A m+1 , ..., A m+r . Let P ra+1 , .... P m+r be proposition 
letters. Throughout the formulas of the given deduction, let A x ,.... 
A m+r be replaced simultaneously in every occurrence by P x , ..., P m+r , 
respectively. Consider any postulate application of the given deduction 
of E* from T*. It is readily shown (using Lemma 3 as illustrated in § 17) 
that the replacements will take place within the A, B, C of the application, 
producing proposition letter formulas A', B', C', every occurrence of A 
becoming an occurrence of A', etc. Thus the sequence of proposition letter 
formulas into which the given deduction of E* from T* is transformed by 
the replacements is a deduction of E from P with the same analysis. As 
the method of this proof shows, the converse rule can also be formulated 
thus: 

Theorem 4 (second version). Let P* be formulas and E* a formula 
having as their distinct prime components A 1( A m . Let P x , .. ., P m 
he proposition letters, not necessarily distinct. Let P, E result from T*. E*, 
respectively, by replacing, simultaneously in all occurrences, A x , ..., A m by 
P x , ..., P m , respectively. Then T* (- E* only if T j- E. 

Except when “formula” is read in another sense than proposition 
letter formula, Theorem 4 is included in Theorem 3. 

Example 6 . To illustrate the proof, let -ia—0 3 (3c(<2=c') 3 
-\a—0) occur in the given deduction of E* from P* as an axiom by 
Schema la (as at Example 3 (c) Step 2). Replacing the distinct prime 
components a =0, 3c(a=c') by the proposition letters TT, 3, respectively 
(or both by TT), gives -i<3 3 (S 3-i<3?) (or -icT 3 (TZ 3 -i TZ)), which is 
an axiom by Schema 1 a in the pure propositional calculus. But replacing 
a—0, 3 c(a=c') 3 -ia—0 (the latter not being prime) by cT, 3, or re¬ 
placing the three prime parts a=0 , 3c(a=c'), a—0 (the first and third 
not being distinct) by 37, 3, C, would not give an axiom by Schema 1 a. 
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Example 7. To illustrate the application cf the converse rule, take 
the fact (which we will establish in § 28 Example 3) that c2f V (-i c2f & S) 
is unprovable in the pure propositional calculus. It follows, by the 
converse rule, substituting a= 0, 3 c ( a = c ') for c2f, S, respectively, that 
a =0 V (—\ a =0 & 3 c ( a = c ')) is unprovable in the number-theoretic 
propositional calculus, i.e. this formula of our original system cannot be 
proved on the basis of the postulates of Group A1 only (although in fact 
it is provable using the complete list, §39). But from the unprovability 
of <C7 V (—icT? & 2) in the pure calculus, it does not follow that 
a —0 V (—1<3=0 & —i£Z=0) is unprovable in the number-theoretic propo¬ 
sitional calculus. Why? 

§ 26. Equivalence, replacement. Let A and B be formulas. We use 
"AB” as abbreviation for (AdB)&(BdA). The symbol 
may be read “equivalent”. It functions as a formal operator, which 
placed between two formulas of the system gives another formula of the 
system. In omitting parentheses, it is then ranked ahead of the other 
formal operators (§ 17). 

We say that A is equivalent to B in the propositional calculus or other 
formal system, if in that formal system ? A ~B. Here the word “equiv¬ 
alent” functions as a metamathematical verb, which placed between 
two formulas of the system gives a statement about those formulas. 

Theorem 5. If A, B and C are formulas : 

*1. hADA. *2. ADB, B DC b A DC. 

*3. Ad(BdC) b B D (A DC). 

*4. AD(BDC) b A&BDC. *5. A&B DC b A D(B DC). 

(Principle of identity, chain inference, interchange of premises, 
importation, exportation.) 

*6. A DB b (B DC) D (A DC). *7. A D B b (C D A) D (C D B). 

*8a. A D B b A&CdB&C. *8 b. AdB bC&ADC&B. 

*9a. ADB b AVCdBVC. *9b. AdB bCVADCVB. 

(Introduction of a conclusion, premise, conjunctive member, or 
disjunctive member, into an implication.) 

*10a. -iA b AdB. *10b. A h^ADB. *11 . B ?AdB. 
(Demonstration of an implication by refuting the premise, or by proving 

the conclusion.) 

*12. ADB b ~iB D -iA. *13. Ad-iB b B D -iA. 

*14°. -iA DB b iBDA. *15°. - 1 AD- 1 B bBDA. 

(Contraposition, and contraposition with double negations suppressed.) 
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*16. A DB, B DA h A~B. 

* 17a. A ~B b A DB. *17b. A ~B b B DA. 

*18a. A ~ B, A b B. *18b. A ~ B, B b A. 

(From the definition of ~ in terms of D and &.) 

*19. b A ~ A. * 20 . A ~ B b B 1 —'A. *21. A '—' B, B ~ C b A ~C. 
(Reflexive, symmetric and transitive properties of equivalence.) 

*22. A D (B D C), -i -iA, -i -iB b^^C. 

*23. —i-j(AdB) b -I - 'AD—i—iB. 

*24. -i-i(ADB), -i-i(B DC) b “i ~i(A DC). 

*25. b - ' - 1 (A&B)'~-i-iA&-i-iB; in particular, 

b -i “i (A ~ B) ~ —i ~i (A D B) & —i —i (B D A). 

(Additional results of interest for the intuitionistic system.) 

Proofs. Eight of these have already been established, as follows: 
*1 at §20 (F); *2 at §21 (8'):1; *3at(5'):3; *4at(6'):2; 5* at (7'); 
*6 at ( 8 '): 2; *10a at § 23 (9'): 6 ; and *11 at § 25 (10'). The reader may 
establish the others, using the derived rules of Theorem 2 for the 
propositional calculus (§23). For example: 

*9a. 1 . A D B, A b B b B VC — D-elim., V-introd. 

2. A D B, C b B V C — V-introd. 

3. A D B, A V C b BVC — V-elim., 1 , 2. 

4. AdB |- AVCdBVC — D-introd., 3. 

*12. 1 . A DB, -iB, A b B — D-elim. 

2. AdB, -iB, A b -iB. 

3. AdB, ~iB b'-'A — -i-introd., 1, 2. 

4. ADB b-iBD-iA — D-introd., 3. 

*14. Similarly to * 12 , but now an application of -i-elim. is added at 
Step 3. 

*22. 1 . A D (B D C), B b A D C — D-elim., *3 (or §21 (5'): 2). 

2. A D (B D C), B b -<“iA D-i-iC — *12 twice, 1. 

3. AD(BDC), —i —iA bBD-i-iC — D-elim., D-introd., 2. 

- 4. A D (B D C), -i-iA b -i-iB D-i-iC —*13, *12, 3. 

*23. Taking A D B, A, B as the A, B, C, respectively, in *22: 

1. (A D B) D (A D B), -i -i(A D B), -i -iA bi^B. But: 

2. b (A D B) D (A D B) — *1. Hence: 

3. —i—i(ADB), —i—iA b ~ 1 *B — 1,2. 
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*24. Taking ADB, B DC, AdC for A, B, C, respectively, in *22: 

1. (A D B) => ((B 33 C) D (A D C)), -i-i(AdB), -n(BDC) (- 
-1 -i (A D C). But: 

2. H (A D B) D ((B 3 C) 3 (A D C)) — D-introd., *6. 

*25. Taking A, B, A & B for A, B, C, respectively, in *22, and using 
Axiom Schema 3 (§ 19), —i —iA, —» —iB \- —i —i (A & B). Applying *12 
twice to Axiom Schema 4a, (- -t -i(A & B) D—i—iA; etc. 

Replacement. Let A be a formal expression. Consider another 
formal expression C. It may happen that A occurs as a (consecutive) 
part of C; indeed, this may happen in more than one way. Suppose that 
it does happen, and that, if it happens in more than one way, a particular 
occurrence of A in C has been specified. We now denote C, with a par¬ 
ticular occurrence of A in C specified, by “C A ”. In juxtaposition notation, 
is EAF, where E and F are the parts (possibly empty) which precede 
and follow the specified part A. Now let B be a formal expression. The 
result of replacing the specified part A of C by B is the expression EBF. 
This we denote by "C B ”. 

Contrast this definition of replacement with the definition of sub¬ 
stitution given in § 18. Replacement takes place for a specified occurrence 
of an expression consisting of one or more symbols. Substitution takes 
place for all occurrences of a single symbol, unless there is a distinction 
between ‘free’ and ‘bound’ occurrences, in which case it takes place for all 
free occurrences. (In § 25 we used replacement in all occurrences. That 
is equivalent to replacement, as now defined, applied successively to 
each of the original non-overlapping occurrences of an expression in an 
expression.) 

Example 1. If A is <3? D S, C v is (J? 3 S) & -i (( L? PS ) V -i Ci) 
and B is —ic2? V $, then C B is (<C? D S) & ~i ((~i e7 V S) V -ic~7). 

The foregoing definition of replacement is stated for formal expressions 
in general. For the case that A, C A and B are proposition letter formulas 
(as in Example 1), we have the following situation (as can be demonstrated 
rigorously by applying the analysis of the scopes of operators (§ 17) to 
the definition of proposition letter formula (§25)): The formula C.^ can 
be built up from the specified part A by applications of Clauses 2—5 of 
the definition of proposition letter formula, and C,, can be constructed 
from B by parallel steps. The number of steps in this construction of C v 
from A, after A is given and exclusive of the steps required to build up 
the parts not containing the specified occurrence of A, we call the depth 
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of that occurrence of A in C A . In other words, the depth of the part A in 
C A is the number of operators within the scopes of which it lies. 

Example 1 (continued). The parallel constructions of C A from A 
and of C B from B are as follows, and the depth is 3. 

c3DS -ic2 V S 

{a d s) v -icT v s) v -ic3 

-1 ((eT D S) V -1<3) -1 ((-Ie3 V 2) V -i<07) 

{a D S) & —i ((cxf 3®)V -i<v?) (c3 D 2) & -i((-.o2 V S) V -i<3) 

Theorem 6. If A, B, C A awi C B are proposition letter formulas related 
as in the foregoing definition of replacement, then A ~ B h C A ~C B . 
(Replacement theorem.) 

Proof, by induction on the depth of A in C A , taking the A and B fixed 
for the induction. The induction proposition is that what is stated in the 
theorem is true, with the fixed A and B, for every C A in which the specified 
occurrence of A is at depth d. Basis : A is at depth 0 in C A '. Then C A is A, 
C B is B, and the conclusion of the theorem is simply A ~ B |- A ~ B, 
which holds as a general property of h Induction step: A is at depth 
^+1 in C A . As hypothesis of the induction, A~B M A ~ M B for 
any proposition letter formula M A in which the specified occurrence 
of A lies at depth d. Now C A must have one of the seven forms M A D N, 
NdM v M a & N, N & M a , M a V N, N V M a , -iM a , where M A and N are 
proposition letter formulas, and A is at depth d in M A . By the hypothesis 
of the induction, A~B [- M A ~ M B . Furthermore, by the appropriate 
one of the following lemmas (taking M A as the A, M B as the B, and N as 
the C of the lemma), M A ~M B \- C A ~C B . Therefore A ~ B b C A ~ C B . 

Lemmas for replacement. If A, B and C are formulas : 

*26. A — B hADC^BDC. *27. A ~ B |-C=)A~C3B. 

*28a. A ~ B hA&C-B&C. *28b. A ~ B hC&A~C&B. 

*29a. A ~ B [-A V C~B V C. *29b. A ~ B bCVA~CVB. 

*30. A ~ B b ~' A ~ -i B. 


Proofs. 

*26.' 1. A ~ B b B Z> A — &-elim. (*17b). 

2. A ~ B |- (A 3 C) 3 (B D C) — *6, 1. 

3. A ~ B [- (B 3 C) ID (A D C) — similarly, using *17a and *6. 

4. A~B [-AdC~BdC — &-introd. (*16), 2, 3. 

*27. Similarly, using *17a and *7, then * 17b and *7. 



§26 


EQUIVALENCE, REPLACEMENT 


117 


Example 1 (concluded). Let ~ be written between each of the 
four pairs of formulas in the parallel constructions of C A and C B . The 
second of the resulting four formulas is deducible from the first by *29a, 
the third from the second by *30, and the fourth from the third by *28b. 
Combining these deductions consecutively, we have the deduction of 
C A -—' C B from A ~ B which is given by the method of proof of Theorem 6. 

Theorem 6 has been stated in terms of proposition letter formulas. 
But by the substitution rule (Theorem 3 § 25), we can apply it for other 
senses of formula, provided that the A, B, C A of the application can be 
obtained from proposition letter formulas by substituting formulas 
simultaneously for the proposition letters. Hence, or directly by the 
method of the above proof of Theorem 6: 

Theorem 6 (second version). If A and B are formulas, C A is a 
formula constructed from a specified occurrence of A using only the operators 
D, &, V, “i, and C B results from C A by replacing this occurrence of A by B, 
then A ~ B [- C B . 

Example 2. Let x be a variable, and A, B and C(x) be formulas. 
By the theorem, A ~ B |- A V Vx(A D C(x)) ~BV Vx(A D C(x)). (This 
can be considered as coming from <37 ~ 2 hc37VC^2VC by sub¬ 
stituting A, B, Vx(A D C(x)) for <37, 2, C, respectively.) But our present 
means are inadequate for deducing A V Vx(A d C(x)) ~A V Vx(B D C(x)) 
from A ~ B. In this the occurrence of A to be replaced is within the 
part Vx(A D C(x)), and so the C A cannot be built up from the occurrence 
of A by using only the operators D, &, V, -i. 

Corollary. Under the conditions of the theorem {in either version), 
A ~ B, C A |- C B . (Replacement property of equivalence.) 

From the theorem by *18a. (Conversely, the theorem is obtained from 
the corollary and *19 by taking Ca ~ Ca as the Ca of the corollary. The 
theorem includes the lemmas, as the cases when the depth is 1.) 

Our results have been developed to give replacement of a single oc¬ 
currence of A at a time. By iterated applications, we can then replace 
any set of occurrences. 

Chains of equivalences. We can now present demonstrations of 
equivalence between proposition letter formulas in the following ab¬ 
breviated way. Let us write 

b C 0 ' ’ C x - ' ... ~ C„_ x ~ C„, 
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where for each i [i — 1, ..., «) either: 

(a) C< is the same formula as C<_ x ; or 
(bj) b C,., ~ C, or (b 2 ) b Q ~C,_ 1 ; or 

(c) C i comes from C,.., by replacing one or more occurrences of A,- 
by B, where (1) b A* B< or (2) b B, ~ A,-. 

Then we can regard “C 0 ~ C, ~ ... ~ C n _ x ~ C„” as an abbreviation 
for (... ((C 0 - C x ) & (C x ~ C 2 )) & ... & (C n _ 2 ~ C„_,)) & (C n _ x ~ C n ); 
and we can understand that b C, ~ C k for all pairs j, k (j, k = 0, ..., n). 
For we have for each Neither b C^^Cjor b C t -~ Cj.j.by *19inCase(a), 
immediately in Case (b), and by applications of Theorem 6 in Case (c). 
Then we have b C* ^ C fc for all other pairs j, k by * 19, *20 and *21. The 
method applies likewise when we have any list T of assumption formulas 
written before the symbol “ b” throughout. We may insert explanatory 
remarks in brackets between links of the chain. (For an example see the 
proof of *57 in § 27.) 

The chain method will apply also when we have instead of ~ some 
other relational symbol for which we have established the corresponding 
reflexive, symmetric, transitive and replacement properties. Furthermore 
in the absence of some of these properties (except transitivity) it can be 
modified to apply, as follows. When symmetry is absent, omit (b 2 ) and 
(c) (2), and require that j < k. When reflexiveness is also absent, omit 
also (a), and require that j < k. When replaceability is absent, omit (c). 
(Examples from Chapter VIII : with all properties present, =; lacking 
symmetry and replaceability, < ; lacking also reflexiveness, <.) 


§ 27. Equivalences, duality. 

formulas: 

Theorem 7. If A, B and C are 

*31. 

b (A&B)&C~A&(B&C). *32. b (A V B) V C ~ A V (B VC). 

*33. 

b A & B ~ B & A. 

*34. b AVB~BVA. 

*35. 

b A & (B V C) ~ 

*36. bAV(B&C)~ 


(A & B) V (A & C). 

(A V B) & (A V C). 

*37. 

r A 6c A ~ A. 

*38. b AVA~A. 

*39. 

b A&(AVB)~A. 

*40. b A V (A & B) ~ A. 


(Associative, commutative, 

distributive, idempotent and 


elimination laws.) 

*41. 

'A b A D B ~ B. 

*42. B b A OB~B. 

*43. 

~i A b A D B ~ -1 A. 

*44. -iB t ADB ~ -iA. 

*45. 

B u A & B ~ A. 

*46. B b A V B ~ B. 

*47. 

B - A & B -—■ B. 

*48. -iB j- A V B ~ A. 


fSpecial caws of implication, 

, conjunction and disjunction.) 
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*49°. I- -i —i A ~ A. 

*50. b“»(A&-iA). *51°. bAV-iA. 

(Law of double negation, denial of contradiction, law of 
the excluded middle.) 

*52°. b A&(BV-.B) ~ A. *53. b AV(B&nB)~ A. 

*54. b A&B&-iB~B&-«B. *55°. b AVBViB~BVnB. 

(For simplifying a disjunction of conjunctions, or a conjunction 

of disjunctions.) 

*56°. b AVB~-i(-iA&-iB). *57°. b A&B ~-i(-iA V-iB). 

*58°. b A3 B~-i(A&-iB). *59°. b A DB- iAVB. 

*60°. b A & B i(A3-iB). *61°. b A V B ~ -iA 3 B. 

(Each two of 3, &, V in terms of the other and -i.) 

*62°. b ~i(A&B) ~-iA V-iB. *63. b “•(A V B) -iA & -iB. 

(Transfer of -i across & and V (De Morgan’s laws, 1847 ).) 

*49a. b A D —1 —iA. *49b. b in nA~ nA. 

*49c. b AV-iAd(-i*iAdA); hence b AV-iA 3 (-1 - 1 A ~ A). 

*50a. b - '(A ~-1 A). *5la. b “i ~i (A V —> A). 

*56a. b A V B 3 —1 (— 1 A & -iB). *51b. b “i "i (“i ~i A 3 A). 

*56b. b iAVB D-i(A&-iB). *57a. bA&B3-i(-iAV-iB). 

*58a. b (A 3 B) 3 -i (A & -iB). *57b. b A&- 1 B 3 -i(-»A V B). 

*58b-d. b AD-iB~-i(A&B)~nAD-iB~-in(-iAV —i B). 
*58e,f. “iB 3 B b “< A 3 B > —'ADB~*i(A & ~iB). 

*58g. b (- 1 - 1 ADB) 3- < (A&-.B).*59a. b “»A V B 3 (A 3 B). 

*60a. b A&B 3-i(A 3-iB). *59b. b (A 3 B) D -1 -i(-iA V B). 

*60b. b A&iB 3 -i(A 3 B). *59c. b ( 1 A 3 B) D- 1-1 (A VB). 

*60c. b -i-«A&B3-i(A3-iB). * 6 la. b AVB 3 (-.A 3 B). 

*60d-f. b -i“i 1 A & -iB ~ i(A 3 B) ~ “i("iA VB) ■—• —1 -i (A & “iB). 
*60g-i. b n(A3B)'— , i(A&iB)~A3mB~-TnA3-iiB. 

*62a. b - 1 AV- 1 B 3 -i(A&B). *61b. hi(AVB) -.(-iA 3 B). 

(Additional results of interest for the intuitionistic system.) 

Proofs for the classical system, excepting *32, *34, *36, *38, *40, 
*53, *55. A little work may be saved if one chooses by postponing proof 
of these seven until duality (Corollary Theorem 8 ) is available. 
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*35. 1. A, B h A&B [- (A & B) V (A & C) — &-introd., V-introd. 

2. A, C 1- A & C 1- (A & B) V (A & C) — &-introd., V-introd. 

3. A, B VC h (A&B) V(A&C) — V-elim., 1, 2. 

4. A & (B V C) b (A & B) V (A & C) — &-elim., 3. 

5. |- A & (B V C) D (A & B) V (A & C) — D-introd., 4. 

6 . A&B b A — &-elim. 

7. A&B 1-B B VC — &-elim., V-introd. 

8 . A&B b A & (B V C) — &-introd., 6, 7. 

9. A & C b A & (B V C) — similarly. 

10 . (A & B) V (A & C) b A & (B V C) — V-elim., 8, 9. 

11 . b (A & B) V (A & C) D A & (B V C) — D-introd., 10. 

12 . b A & (B V C) ~ (A & B) V (A & C) — &-introd. (*16), 5, 11 . 

*49. 1. b "i~iA D A — -i-elim., D-introd. (or Axiom Schema 8). 

2. A, -iA b A. 

3. A, -i A b iA. 

4. A b “i“iA — -i-introd., 2, 3; etc. 

*51. 1. -i(AViA), A b AV-iA — V-introd. 

2. -i(AV-iA), A b i(AV-iA). 

3. -i(AV-iA) b “iA — -i-introd., 1, 2. 

4. -i(AV~iA) b “>“>A — similarly. 

5. b - t"'(AV _ iA) — -i-introd., 3, 4. 

6. b AV-iA — i-elim., 5. 

Remark 1. Thus in the formal system without Axiom Schema 8, 
-,-iBDBbAV-iA where B is AV -i A. Conversely, in the intuitionistic 
system, AV-iA b “iiA D A, thus: 

1. A b "i "i A DA — *11. 

2. -i A b “i “i A DA — *10b. 

3 . A V “iA b ”i A DA — V-elim., 1, 2. 

Thus either of nA D A or AV-iA can be chosen as the one non- 
intuitionistic postulate of the classical system. 

*52. By *45 and *51. 

*54. Similarly, by *47 and *50. Parentheses have been omitted in the 
result, since by *31 it is immaterial which way the association is taken. 

*56. 1. A, -iA & -iB b A. 

2. A, -iA& -iB b ~>A — &-elim. 

3 . A b - i( - 'A&-iB) — -i-introd., 1, 2. 

4 . B b "i(~>A& -iB) — similarly. 
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5. h AVB => -i(-.A&-iB) — V-elim., 3, 4, D-introd. 

6 . -i (A V B), A h AVB - V-introd. 

7. -I(A V B), A b -i(A VB). 

8 . -i(AVB) b “»A — -i-introd., 6, 7. 

9. -i (A V B) b ~>B — similarly. 

10. b -l (AVB)D-iA&-iB — &-introd. 8, 9, D-introd. 

11. b “i(“iA&nB) D AVB — contraposition (*14), 10. 

*57. Presented as a chain of equivalences: b i(iAV-iB) 

—i (—i -l A & —i —i B) [*56] ~ -i A & -i —i B [*49] ~ A & B [*49]. 

*59-63. Set these up by the chain method. 

Proofs for the intuitionistic system. *49c. By Remark 1; by *49a, 
*16 and *17a, b “> “iA 3 A~(-nA~A). *51a. By the proof of *51 
omitting Step 6. *5lb. Apply *12 twice to *49c, and use *5la. *58d. 
Use *63. *60f. Use *25. 

To interchange two expressions A and B throughout a third expression 
C is to replace in C, simultaneously, all occurrences of A by B and all 
occurrences of B by A (examples will follow). 

Theorem 8 °. Let D be a proposition letter formula constructed from the 
distinct proposition letters P 1( ..., P m and their negations -iP 1( ..., -i P m 
using only the operators &, V. Then a formula Dt equivalent to the negation 
iD o/ D is obtained by the interchange throughout Do/& with V and of 
each letter with its negation. 

In other words, if D be such a proposition letter formula, and Dt be the 
result of the described interchange performed on D: b “iD ~ D+. 

Example 1°. Taking -i<37 & (-i25 VB) as the D, -iD is equivalent 
to a V (S & -iB). 

Proof amounts essentially to this: the -i of -iD can be transferred 
progressively to the interior by applications of *62 and *63, and any re¬ 
sulting double negations then discharged by applications of *49, in doing 
which D is transformed into Dt (details to follow). 

Example 1 (concluded). 

b ->( -i<37& ( -iBV B)) ~ 

—i -i <37 V -i ( —i S V B) 

-i -i V (-1—1 B & —iB) 

<37 V ( £&-i2). 

To give the proof more explicitly, we take as the induction number 
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(for a course-of-values induction) the number of occurrences of & and 
V in D; call it the grade of D. 

Basis : D is of grade 0. Then for some proposition letter P, D is P or D is 
-i P. Case 1: D is P. Then -i D ~ -i P [case hypothesis] ~ Pt [def¬ 
inition of t] ^ D+ [case hypothesis]. Case 2: D is -iP. Similarly, using 
*49. Induction step: D is of grade 1. Then for some proposition letter 
formulas A and B of the type under consideration with grades <; g, either 
D is A & B or D is A V B. Case 1: D is A & B. Then \- -i D ~ -i (A & B) 
[case hypothesis] ~ ~i iAV— iB [*62] >—• A + V Bt [hypothesis of the in¬ 
duction] ~ (A &B)t [definition of t] ^ Dt [casehypothesis]. Case 2: D is 
A V B. Similarly. 

Example 2°. By the substitution rule (Theorem 3 § 25), substituting 
B, -ic9? V C for <C7, B in the result of Example 1, 

h -i(-i8&(-i[n^vqv[-ic?vq))~sv([-i^vq&-i[-i^vq). 
In fact, for any formulas A and B, 

|- n(iA & (iBVB)) ~ A V (B & -iB). 

As this example illustrates, the theorem can also be stated using any 
formulas A x , ...,A m in place of the proposition letters P 1( ...,P m , 
provided the A lt ..., A m retain their identity throughout the construction 
of D, and are held intact in the interchange operation. (Second version of 
Theorem 8.) 

Corollary 0 . An equivalence between two letter formulas E and F of 
the type described in the theorem is preserved under the interchange throughout 
E and F of & with V. 

In other words, if E and F be two such proposition letter formulas, and E' 
and F' be the results of the described interchange performed on E and F, 
respectively: If [- E ~F, then p E' ~ F'. (Principle of duality.) 

Example 3°. By *52, p <3 & (B V -i B) ~ <3. Hence (taking 

& (B V -i B) as the E and <3 as the F) h^V(S&-iB)~ a. 

Proof. By hypothesis, h E~F. Let us substitute for each propo¬ 
sition letter P within E and F the negation -i P of that letter, indicating 
this substitution operation by “*”. By the substitution rule (Theorem 3 
§ 25), then j- E* ~ F*. Next let us replace within E* and F* each doubly 
negated letter -i -iP by the simple letter P, indicating this operation by 
“t". By the law of double negation (*49) and the replacement property of 
equivalence (Corollary Theorem 6), p E*t ~ F*t. The effect of these two 
operations is to interchange the proposition letters with their negations 
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in the given equivalence. Now by Theorem 6 (or *30), b -iE*t ~ -iF*t. 
Evaluating the negations by Theorem 8 , b E*tt ^ F*tt. These last 
two steps effect the interchange of & with V, and interchange the 
proposition letters with their negations a second time to restore them 
to their original condition. What we now have is therefore b E' ~ F\ 

Example 3 (concluded). 


b E F. 

b 

ZA & ( BV-i S) ~ 

<3. 

b E* ~F*. 

b 

->ZA &(-iSV -1 iB) ~ 

-i 

b E*t ~F*t. 

b 

-\ZA & (— 1 S V 8 ) ~ 


b-iE*t~-iF*t. 

b 

—1 (—1 ZA & (-18 V 8 )) ~ 

—| “1 cX 

bE*tt~F*tt, i.e. b E'~F\ 

b 

ZAV( 8 & -iB) ~ 

cX 


Example 4°. Substituting any formulas A, B for ZA, 8 in the result 
of Example 3 (by Theorem 3 § 25), (- A V (B & -i B) ~ A. This is *53. 

Similarly *32, *34, *36, *38, *40, *55 follow from *31, *33, *35, *37, 
*39, *54, respectively, by duality (Corollary Theorem 8 ), when A, B, C 
are simple proposition letters; and thence by the substitution rule (Theo¬ 
rem 3), when A, B, C are any formulas. 

Example 5°. By duality (Corollary Theorem 8 ): 

(a) If p ZA V 8 ~ ZA, then 
But we cannot infer the following: 

(b) “For any formulas A and B, if h A V B ~ A, then f-A&B~ A.” 
(Indeed, taking ZA, 8 & -iB for the A, B in (b), “ b A V B ~ A” becomes 
“ b ZA V (8 & -iB) ~ ZA”, which is true by *53; while " b A & B ~ A” 
becomes “b ZA & 8 & -iB ~ ZA” , which is false as will be shown in 
Example 4 § 28.) Explain. (How does the present situation differ from 
Examples 2 and 4 ?) — To state a second version of the corollary, we must 
require that the A x ,.. ., A m be distinct prime formulas (cf. Theorems 3 
and 4). — Since (b) is false, by Theorem 3 so is: 

(c) “ZAV 8 ~ ZA b ZA &8 ~ ZA." 

Thus duality holds only as a subsidiary deduction rule, not as a direct rule. 

The recognition of duality in logic goes back to Schroder 1877 . 

In applying duality to a formula that has been abbreviated by omitting 
parentheses under the convention of § 17, which ranks & ahead of V, care 
must be taken to show the scopes of the operators without change in the 
result. (Hence we usually prefer in this connection not to apply the con¬ 
vention between & and V.) 

As an exercise, the reader may verify the following addendum to the 
corollary, by reexamining the proof. 
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Corollary (second part) 0 . Also: If j- E D F, then |- F' D E'. (Dual¬ 
converse relationship.) 

Example 6 °. The axiom has as dual-converse the axiom 

d D H V $. The provable formula eT & S D d V S is its own dual- 
converse. 

§ 28. Valuation, consistency. Since setting up the formal system, 
or in this chapter a subsystem, our metamathematical investigations 
have been devoted chiefly to establishing the provability of certain for¬ 
mulas, and the deducibility of certain formulas from other formulas, i.e. 
to developing logic and mathematics within the formal system. This is 
a necessary part of our program, and contributes to showing that the 
formal system does constitute a formalization of a certain part of mathe¬ 
matics. 

But also in metamathematics questions are asked which relate to a 
formal system as a whole. One of these is the question of the ‘consistency’ 
of the system, which is fundamental in Hilbert’s program (§ 14). 

The propositional calculus (and generally, any formal system having 
the symbol -i for negation) is said to be ( simply) consistent, if for no 
formula A are both A and -iA provable in the system; and to be [simply) 
inconsistent in the contrary case that for some formula A, both |- A and 
I- -i A. 

This is a strictly metamathematical definition. It refers only to the 
formal symbol -i, and to the definitions of formula and provable formula. 
It thus becomes an exact mathematical problem, which we can consider 
in metamathematics, to prove the consistency of a given formal system. 

The definition and problem of consistency take on significance from 
outside the metamathematics, under the interpretation of the formal 
system as a formalization of an informal theory, with the symbol -\ 
expressing negation. The propositions expressed by two number-theoretic 
formulas A and -iA, if A does not contain free variables (or the propo¬ 
sitions expressed for each particular set of values of the variables, if A does 
contain free variables), taken together constitute a contradiction. Like¬ 
wise, in the case of proposition letter formulas, on interpreting the propo¬ 
sition letters by any particular propositions. A metamathematical proof 
of the consistency of the formal system would hence afford security 
against a contradiction’s arising in the informal theory. 

For the propositional calculus (and generally, for any formal system 
which has &-elimination and weak -i -elimination as postulated or derived 
rules), the above definition is equivalent to the following. The system is 
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(simply) consistent, if there is some unprovable formula; (simply) in¬ 
consistent, if every formula is provable. For if both \- A and p -iA, 
then by use of weak -i-elimination (§ 23), b B for every formula B. In the 
case of consistency, A & -i A is an example of an unprovable formula, 
since otherwise by &-elimination both |- A and |- —*A. 

The definition of consistency (in the first form) and the definition of 
provable formula (say in the first form, § 19) suggest a plan of attack on 
the problem of proving consistency (not the only possible plan). Suppose 
we can find a metamathematical property of formulas such that (a) the 
axioms have the property, (b) if the premises for an application of a rule 
of inference have the property, so does the conclusion, and (c) two formu¬ 
las of the forms A and -i A cannot both have the property. Then using 
(a) and (b) every provable formula would have the property, and by (c) the 
system would be consistent. In this section we shall give a metamathe¬ 
matical proof of consistency for the propositional calculus following this 
plan. 

The property of formulas which will be used is suggested by the logical 
interpretation of the propositional calculus. We conceive of each propo¬ 
sition letter as a variable whose values are propositions, and we conceive 
of these propositions as being each either true or false. The operators of 
the calculus D, &, V, -i form from these propositions other propositions 
whose truth or falsity will depend only on the truth or falsity of the com¬ 
ponent propositions, according to tables to be given presently. (Hence the 
operators of the calculus are sometimes called ‘truth-value functions of 
propositions’.) Then it will appear that the provable proposition letter 
formulas all have the property that they are identically true, in the sense 
that they represent true propositions for all possible permutations 
of true and false propositions as values of the proposition letters contained 
in them. 

To use this idea for the purpose of a metamathematical consistency 
proof for the calculus, it is necessary to avoid the reference to ‘propo¬ 
sitions’, ‘truth’ and ‘falsity’, which have connotations that are extraneous 
to the metamathematics. This we can do, since nothing essential in the 
argument outlined above depends on the values of the proposition letters 
being propositions, or on the nature of truth and falsity, except that true 
and false propositions are distinct from each other. 

The purely mathematical character of what we shall do may be em¬ 
phasized by an analogy to the elementary school arithmetic of positive 
integers. While the numbers 1, 2, 3, ... in that arithmetic were intended 
to have a meaning for counting and measurement, so far as the addition 
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and multiplication tables were concerned they could be any enumeration 
of distinct objects. From this standpoint, the arithmetic deals with 
operations, i.e. functions, + and *, over a domain of objects {1, 2, 3, .. 
and depends only on the possibility of recognizing and distinguishing 
between those objects, and not on their intrinsic nature. 

We shall now set up an arithmetic in like sense for a domain of only 
two objects, with four functions D, &, V, -i. This describes succinctly 
what we shall do. Since for metamathematics, D, &, V, -i are meaningless 
given objects, a more precise statement of what we shall do is the 
following. We introduce a metamathematical computation process (called 
a valuation procedure), by which a function in the arithmetic (or a table 
for such a function, called a truth table) is correlated to each of the symbols 
D, &, V, -i, and thence to each proposition letter formula. Then we 
study metamathematical properties of proposition letter formulas defined 
in terms of the correlated functions (or tables). 

Since only the distinctness of the two objects (truth values) is required for 
the abstract arithmetic, it is immaterial what they are called. We might 
designate them as “0” and “1”, or “ + ” and or “f” and "4-”, or 

"t” and "f”, etc. We choose the last pair of symbols, which suggest re¬ 
spectively the notions 'true’ and ‘false’ of the logical interpretation. 

We begin by considering the proposition letters as variables ranging 
over the domain {t, f}. 

We then consider the operators of the calculus as functions over this 
domain, defined by the following tables, analogous to the addition and 
multiplication tables in the arithmetic of positive integers. From the table, 
to any given value(s) of the independent variable(s), the corresponding 
value of the function can be read. 


c3DS J?&8 c 2VS -le* 


s 

t 

f 

8 

t 

f 

8 

t 

f 



eT t 

t 

f 

a t 

t 

f 

a t 

t 

t 

cT t 

f 

f 

t | 

t 

f. 

f 

f 

f 

t 

f 

f 

t 


(The tables for -i and V are the same as were given in § 10 Example 4 
for ' and *, when the two objects are written “0” and “1”.) 

Then each proposition letter formula A in a given list P 1( ..., P m of 
distinct proposition letters represents a function of those letters regarded 
as independent variables over the domain {t, f}. To each w-tuple of values 
of the letters, the corresponding value of the function can be computed 
by a series of applications of the fundamental tables. 
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Example 1 . The letter formula <3 ~B, i.e. (d "D S) & (S D <CT) 
(§ 26), represents the function which has the following table. 

a 


s 

t 

f 

<3 t 

t 

f 

f 

f 

t 


The computation of the value entered in the upper right square is as 
follows. 

(c 3 D £) & (B Z> a) 

(t => f)&(f 3 t) 
f & t 

f 


The square arrangement of the tables is special to two variables (where 
it helps to suggest properties of the functions). For m variables P lP ...', P m 
generally, we can arrange the 2 m possible w-tuples of arguments vertically 
one below another in some fixed order, and write the corresponding func¬ 
tion values in a value column opposite these. 


Example 2. 


a 

B 

C 

-i[c2VBD(MC)V -»<3] 

t 

t 

t 

f 

t 

t 

f 

t 

t 

f 

t 

t 

t 

f 

f 

t 

f 

t 

t 

f 

f 

t 

f 

f 

f 

f 

t 

f 

f 

f 

f 

f 


A proposition letter formula E in the distinct proposition letters 
P x , ..., P m is said to be identically true, if the value column of its table 
contains only t’s; identically false, if only f’s. Two proposition letter 
formulas E and F in P 1( ..., P m are said to be identically equal, if their 
tables have the same value column. (In other words, an identically true 
E represents the constant function t, an identically false E the constant 
function f, and identically equal E and F the same function.) 

These definitions so stated apply to E (to E and F) considered as 
proposition letter formula(s) in a specified list P 1( ...,P m of distinct 
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proposition letters, not all of which (according to § 25) need occur in the 
formula(s). We may suppose first that P 1( ..., P m is the minimal eligible 
list, i.e. the list comprising exactly those distinct proposition letters 
which occur in E (in E or in F). Then if we add to the list other letters, 
each row in the table(s) for the formula(s) will simply be split into a 
number of rows (2 k of them if k letters are added) showing the possible 
assignments of values to these additional letters, without altering the 
computation process or value entry. Hence if we have identical truth or 
identical falsity (identical equality) with respect to the minimal list, we 
shall have it also with respect to any other list, and conversely. Therefore 
the reference to the list may be omitted. 

Theorem 9. A necessary condition that a proposition letter formula E 
be provable (or deducible from identically true formulas T) in the propositional 
calculus is that it be identically true ; i.e. if 1- E, then E is identically true. 

Proof is by course-of-values induction on the length of the given proof 
of E, using the two following lemmas. 

Lemma 12 a. A proposition letter formula which is an axiom is iden¬ 
tically true. 

Proof. For each of the ten axiom schemata of the propositional 
calculus (Postulates la, lb, 3 — 8 § 19, or intuitionistically 8 1 § 23 instead 
of 8), we can easily verify the following fact by computation: The table 
which is obtained by assigning the values t and f in all possible ways 
directly to the parts A, B, C which appear in the schema contains only 
t’s in the value column. This amounts to verifying the lemma treating 
the A, B, C of the schemata as simple proposition letters <^?, S, C. 

The truth of the lemma follows from this. For consider any axiom 
which is a proposition letter formula. This comes from one of the schemata 
by taking as the A, B, C certain proposition letter formulas in say 
P lf ... P m as joint list of distinct proposition letters. Now no matter 
what m-tuple of t’s and f’s is assigned as values to P 1( ... P m , the triple 
of values for A, B, C to which it leads must in turn (as already verified) 
lead to the value t for the whole axiom. 

Lemma 12b. If the premises for an application of the rule of inference 
are identically true proposition letter formulas, so is the conclusion. 

Proof. ’ By inspection of the following table, we see that the only 
pair of values of A, B which gives the value t to both premises A and A D B 
for the rule of inference of the calculus (Postulate 2) is the pair t, t; and 
this pair does give the value t to the conclusion B. 
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A 

B 

A 

A D B 

B 

t 

t 

t 

t 

t 

t 

f 

t 

f 

f 

f 

t 

f 

t 

t 

f 

f 

f 

t 

f 


Hence, if A and B are proposition letter formulas in P^ ..., P m , any 
m-tuple of values of P^ .... P m which gives both premises the value t 
must give this value also to the conclusion. By hypothesis, the premises 
have the value t for all ra-tuples of values of P x , ..., P m . Therefore the 
conclusion has also. 

Example 3. The formula -i[U?VS D (B & C) V —\£ f \ is not prov¬ 
able in the propositional calculus, because there are f’s in the value column 
of its table (Example 2). Likewise, CZ V & S) is not provable, be¬ 
cause it takes the value f when TZ, S take the values f, f. 

Corollary 1. A necessary condition that two proposition letter formulas 
E and F be equivalent is that E and F be identically equal) i.e. if h E ~ F, 
then E and F are identically equal. 

Proof. If E and F are equivalent, then by definition of equivalence, 
E ~ F is provable. Hence by the theorem, E ~ F is identically true. Re¬ 
ferring to the table for ~ (Example 1), we see that E ~F can only 
receive the value t when E and F receive either both the value t or both 
the value f, i.e. when they receive the same value. Since E ~ F is iden¬ 
tically true, i.e. does always receive the value t, E and F do always receive 
the same value, i.e. they are identically equal. 

Example 4. 3 and TZ are not equivalent, because they 

take different values (namely, f and t, respectively) when e.g. TZ, S are 
given the values t, t 

Corollary 2. The propositional calculus is ( simply ) consistent) i.e. 
for no formida A, both h A and b —iA. 

Proof. Using the second version of the definition of simple con¬ 
sistency (above), this is already proved by Example 3. 

To give the proof directly from the original definition, suppose one of 
A and -i A is provable. Then by the theorem it is identically true; then 
using the table for ~i, the other is identically false, hence not identically 
true, and hence by the theorem unprovable. 
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This establishes the consistency for the propositional calculus in terms 
of proposition letter formulas. If A* and -iA* be provable formulas in 
the calculus for another notion of formula, then by the converse of the 
substitution rule (Theorem 4 § 25), there would be provable proposition 
letter formulas A and -iA. Thus the consistency extends to the other 
senses of formula. 

This consistency proof, of course, does not hold good for the addition of 
another group of postulates, even should that group by itself be con¬ 
sistent. 

The consistency proof for the propositional calculus was first given by 
Post 1921 . (Cf. also Lukasiewicz 1925 , Hilbert-Ackermann 1928 .) 

The arithmetic of a domain of two objects, used in this section to give 
the consistency proof, may be thought of as an interpretation of the 
calculus in the following sense. The valuation tables provide arithmetical 
significances for the operators of the calculus. The proposition letters are 
considered as independent variables ranging over the domain {t, f} of the 
arithmetic. Each proposition letter formula is interpreted as expressing 
the proposition that its value is t for all choices of the values of its in¬ 
dependent variables. By Theorem 9, only formulas which are true under 
this interpretation are provable. For the proposition expressed by a 
formula under this interpretation is equivalent to the formula’s having 
the metamathematical property of identical truth, as defined above 
in terms of a computation procedure using t and f. 

On the other hand, in the (usual) logical interpretation , the proposition 
letters are considered as independent variables ranging over some domain 
of propositions. A proposition letter formula then expresses the general 
proposition that all the particular propositions, expressed by it for dif¬ 
ferent choices of propositions from that domain as values of its independ¬ 
ent variables, are true. The logical interpretation is related to the 
arithmetical interpretation by putting the particular propositions (from 
the domain considered) into many-one correspondence with the two 
objects t, f, those propositions which correspond to t being true and those 
which correspond to f being false. The proposition expressed by a formula 
under this interpretation is not equivalent to a metamathematically 
definable property of the formula, except for specially restricted domains 
of the propositions allowed as values of the proposition letters. 

§ 29. Completeness, normal form. Another problem which we 
may be able to treat in metamathematics is that of the ‘completeness’ of a 
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given formal system. For example, we have listed eleven postulates for 
the propositional calculus (§ 19). Can we give a reason why we stop with 
just these? Might we with advantage attempt to discover others which 
could be added to the list to give more provable formulas ? To be able to 
answer these questions, we must first provide some criterion as to what 
we want to be able to prove in the system. Different notions of complete¬ 
ness will result according to the criterion chosen. 

We may give the criterion a positive form, and say that the system is 
complete, if its postulate list already provides all we need for some pur¬ 
pose. For example, suppose that some property has been defined for for¬ 
mulas of the system; or alternatively, that an interpretation has been 
given to the formulas of the system, in which case the property is that the 
formulas express true propositions under the interpretation. Relative to 
such a property or interpretation, definitions of both consistency and 
completeness can be given as follows. 

The system is consistent with respect to the property (or interpretation), 
if only formulas which have the property (or express true propositions 
under the interpretation) are provable. The system is complete with respect 
to the property (or interpretation), if all formulas which have the property 
(or express true propositions under the interpretation) are provable. 

Unlike the notion of simple consistency given in the preceding section, 
these notions of consistency and completeness relative to a property or 
interpretation may not always belong to metamathematics. Whether or 
not they do we shall have to consider from case to case, according to 
whether the property (or interpretation) is one which can be formulated 
within metamathematics. 

For the propositional calculus we have the property of identical truth 
(or if we prefer, the interpretation of the calculus as an arithmetic of a 
domain of two objects), which can be formulated in metamathematics. 
Theorem 9 is thus a metamathematical consistency theorem for a certain 
property of letter formulas (or interpretation of the calculus). 

To recapitulate the idea of completeness relative to an interpretation: 
A system is complete under a given interpretation, if the deductive 
postulates (or transformation rules) enable us to prove in the system all 
the true propositions which its formation rules enable us to express in the 
system. 

We are led to other formulations of completeness, if we give the criterion 
for what formulas should be provable a negative form, and say that the 
system is complete, if the postulates provide all that we can afford to 
have lest some undesirable effect ensue. An effect which comes to mind is 
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simple inconsistency. The completeness notions obtained in this way will 
always be metamathematical, if the effect to be avoided is metamathe- 
matically describable (as in particular simple inconsistency is). Exact 
formulations will be given for particular formal systems as we come to 
them. 

Notice that a consistency theorem will always be a theorem to the 
effect that at most such and such formulas are provable; and a com¬ 
pleteness theorem one to the effect that at least such and such formulas 
are provable. 

From this general introduction to the problem of completeness, we 
turn now to consider it for the propositional calculus. We shall show first 
(Theorem 10) that the calculus is complete with respect to the property 
of identical truth. 

Theorem 10° and Corollary 1°. The conditions of Theorem 9 and 
Corollary 1 are sufficient (as well as necessary); i.e. if E is identically true, 
then (- E, and if E and F are identically equal, then [- E ~F. 

The proof will be based on two lemmas. Let P x , ..., P m be distinct 
proposition letters. Given an m-tuple of t’s and f’s as values of P x , .... P„„ 
by the corresponding letter m-tufle we shall mean the sequence Q x , ..., Q m 
of letters and negated letters where, for each / (/ = 1, ..., m), Q, is P, 
or -i P 3 according as the given value of P, is t or f. 

Example 1. Let d, B, C take the values t, f, t, respectively. The 
corresponding letter m-tuple is Cl, -i B,. C. 

Lemma 13. Let Hi be a proposition letter formula in the distinct propo¬ 
sition letters P 1( ...» P m ; let an m-tuple of t’s and f’s be given as values of 
P x , ..P m ; and let Q 1( .,Q m be the corresponding letter m-tuple. Then 

Qi.Qm b E or Q x , ..., Q m F -iE, according as for the given m-tuple 

of values E takes the value t or the value f. 

Example 2. Corresponding to the table in Example 2 § 28 for 


[Cl V B D (B 

&C) V 

-i Cl], 

we now have 

eight 

deductions: 

a, 

B, 

c 

b 

“i “i [Cl V B 

D 

(B 

&C) 

V -1<3]. 

a, 

B, 

—iC 

b 

-i [dVB 

D 

(B 

&C) 

V-Ic3]. 

Cl, 

-iB, 

c 

b 

-.[<3 VB 

D 

(B 

&C) 

M-yd}. 

a, 

-iB, 

—iC 

b 

-i [<3 VB 

D 

(B 

&C) 

V -yd}. 

nc3, 

B, 

c 

b 

—[ [Cl V B 

D 

(B 

&C) 

M -yd}. 

“lev?, 

B, 

—iC 

b 

-i “i [Cl V B 

D 

(S 

&C) 

V-id]. 


iB, 

c 

b 

-i -i [Cl V B 

D 

(B 

8c C) 

V -id}. 

— 

-iB, 

— i c 

b 

-i-i [Cl V B 

D 

(B 

8c C) 

V-yd}. 
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Proof of Lemma 13, by course-of-values induction on the number of 
(occurrences of) logical symbols in E; call this number the degree. 

Basis : E is of degree 0. Then E is for some j. In the given m-tuple of 
values, the value of Pj is t or is f. Case 1: the value of P, is t. Then Q^isP,, 
i.e. Qj is E; and so by general properties of h Qi, • • •, Q m h E, as was to 
be shown since in this case E takes the value t. Case 2: the value of P, is f. 
Then similarly Q, is -iE, and so Q x , ,.., Q m b “iE, as was to be shown. 

Induction step: E is of degree d-\- 1. Then for some proposition 
letter formulas A and B in P 1( ..., P m of degrees ^ d, E is A 3 B or 
A & B or A V B or -i A. Case 1: E is A D B. Four subcases arise, according 
as for the given m-tuple of values of Pi, ..., P m , the formulas A, B take 
the respective values t, t or t, f or f, t or f, f. Subcase 2: A, B take the values 
t, f. Then (by the upper right entry in the table for 3) E takes the value f, 
so we are to show that Q x , ..., Q m b -iE. But by the hypothesis of the 
induction, Q v b A and Q x , ..., Q m b iB. Also, using *41 or *44, 

A, -iB b ”i(A D B); i.e. A, -iB b iE. Hence Qi, ..Q m h “»E, as 
was to be shown. The treatment of the other cases and subcases is 
similar, using *41—*48, *49a. 


Lemma 14. Let E be a proposition letter formula in the distinct propo¬ 
sition letters P x , ..., P m . If for each of the 2 m m-tuples of t’s and f’s, 
Qi. • • •> Qm b E where Q 1( ..., Q m is the corresponding letter m-tuple, 
then Pj V ”n Pj.P m V-iP m b E. 


Proof of Lemma 14. By 2 m ~ 1 + 2 m-2 + ... + 1 applications of 
V-elimination. For example when m = 2, by hypothesis: 


(a) 


Pi, P 2 b E. 
Pi, ~iP 2 b E. 
-iPj, P 2 b E. 
-iPi, -iP 2 b E. 


By two applications of V-elimination: 


1 P x , P 2 V-iP 2 b E. 
\ -iPj, P 2 V-iP 2 b E. 


By one additional application, 


(c) 


Pi V-iPi, P 2 V-.P 2 b E, 


as was to be shown. 


Proof of Theorem 10. Say E is a proposition letter formula in 
Pi, ..., P m . By hypothesis E is identically true, i.e. takes the value t 
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for each wt-tuple of t’s and f’s as values of P 1( .... P m . Then by Lemma 
13, the hypothesis of Lemma 14 is satisfied; so by Lemma 14, 
PiV-iPp ...,P m V-iP m b E. Thence using *51, b E. 

Remark 1. All but the final step of this proof is good for the in- 
tuitionistic system also. Hence in that, for any letter formula E in 
Pi, • . ., Pm" 

(a) Pj V-iPj, ..P m V-iP m b E, if E is identically true. 

(b) PiV- iPj .P m V-iP m b EV-.E. 

The theorem expresses the completeness of the deductive rules of the 
propositional calculus for the purpose of establishing letter formulas 
which are identically true under the arithmetic interpretation; and 
Corollary 1 says that the deductive theory gives a complete account of 
the equality of functions definable in the arithmetic. 

Corollary 2°. The addition to the postulate list for the propositional 
calculus of an unprovable letter formula for use as an axiom schema would 
destroy the simple consistency. 

Proof. By Theorem 10, this letter formula must receive the value f 
for some set of values of the proposition letters which it contains. Select 
such a set of values, and use the new axiom schema by substituting 
Cl V -I Cl for the letters which have the value t, and Cl & -i Cl for those 
which have the value f. The resulting new axiom would be identically 
false. Hence by Corollary 1 it would be equivalent to Cl & -i Cl which is 
also identically false. So (using * 18a) Cl & -i Cl would also be provable, 
and the system would thus be inconsistent (§ 28). 

Corollary 2 says that the propositional calculus is incapable of being 
enlarged by postulates of the same character as those already listed without 
destroying the simple consistency. This is a completeness property of the 
second type described in the introductory remarks. 

Given an wi-tuple of t’s and f’s as values of P x , ..., P m , and letting 
Qi, .... Q m be the corresponding letter w-tuple, we call Q x & ... & Q m 
(Qj V... V Q^, § 27) the corresponding elementary conjunction ( disjunction). 

Example 1 (concluded). The corresponding elementary conjunction 
(disjunction) is Cl & -»S & C (-i Cl V S V -iC). 

Theorem 11 °. A proposition letter formula E in the distinct propo¬ 
sition letters P lt ..., P m is equivalent to a formula F (called a principal 
disjunctive normal form of E) having one of two forms, as follows. If E takes 
the value t for some m-tuples of t’s and f’s as values of Pj, ..., P„„ then F 
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is the disjunction (in some order) of the corresponding elementary con¬ 
junctions. If E is identically false, then F is P x & -1 P x . (Dually , E has a 
principal conjunctive normal form G, in which the roles of V and & and of 
t and f are interchanged ; (- E ~ G). 

Example 2 (continued) 0 , -i [<C? V ® D (2 & C) V -i <C?] has as a p.d.n.f. 
(IH & $ & -iC) V (<C2 & -iS & C) V (c^f & -iB & -iC), and as a p.c.n.f. 
-i^V-iSV-iC&c^V-iSV-iC& c^V-iSVC& c3VSV-iC& c^VSVC. 

Proof. By Theorem 10 Corollary 1, since (as is easily seen from the 
specified form of a p.d.n.f., and the valuation tables for -i, &, V) E 
and F are identically equal. 

The propositional calculus, besides being deductively complete in two 
senses (Theorem 10 and Corollary 2), is also notationally complete in 
the sense that each of the 2% m possible truth-value functions of m variables 
Pj, ..., P m can be represented by a letter formula in those letters. For, 
given the table for the function, we can build a p.d.n.f. (uniquely de¬ 
termined to within the order of its disjunctive members) to represent it. 

These three completeness results appeared first in Post 1921 ; the 
present method of proof of Theorem 10 is due to Kalmar 1934 - 5 . (The 
writer also used it, as an application of V-elimination in another context, 
in a first draft of 1934 .) 

Hilbert and Ackermann 1928 gave a version of Post’s proof, which 
consists in establishing the normal form theorem first, by reduction 
techniques going back in part to the nineteenth century workers on 
symbolic logic. Briefly, a letter formula E can be reduced to a p.d.n.f. 
(or p.c.n.f.), using the chain method § 26, as-follows. First, *58 or *59 is 
used to remove the occurrences of D. Second, the occurrences of -i are 
transferred to the interior by repeated applications of Theorem 8 . Third, 
the resulting expression is "multiplied out” using the distributive law 
*35 (with *33) as in ordinary algebra with V, & in the role of +, • (or 
vice versa, using *36 with *34, for p.c.n.f.). Fourth, simplifications are 
performed, based (besides on *3 1 —*34) on the idempotent laws *37, *38 and 
on *52—*55, so that the resulting disjunctive members (for p.c.n.f., 
conjunctive members) or "terms” contain each at most one occurrence 
of each letter, except if the p.d.n.f. P x dt-iPj (the p.c.n.f. P x V iPJ is 
reached. Fifth, in the non-exceptional case, missing letters are intro¬ 
duced into the terms using *52 and *35 (*53 and *36). Sixth, using 
*31 —*34, *37, *38, the letters and negated letters are brought to the normal 
order within terms, so that the terms become elementary conjunctions 
(elementary disjunctions), and duplications of terms are suppressed. 
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Example 2 (concluded) 0 . Although our proof of Theorem 11 es¬ 
tablishes that V S Z> (S & C) V -i <3] is equivalent to 

(C2&S&-.C) V (<3&-.B&C) V (<3&-iB&-.C), 

the reader may find it of interest to reduce the one to the other by the 
procedure just described. The fourth operation leads to 
(<fT & -iS) V (cAf & -iC) V (B & -iC & <3). — From this by *40 we get as 
a “disjunctive normal form” (not “principal”) (<2?& -nB) V (<£?& -iC), and 
thence using *35 the still shorter equivalent (not a d.n.f.) (iS V-iC). 

§ 30. Decision procedure, interpretation. We know examples 
in mathematics of general questions, such that any particular instance 
of the question can be answered by a preassigned uniform method. 
More precisely, in such an example, there is an infinite class of par¬ 
ticular questions, and a procedure in relation to that class, both being 
described in advance, such that if we thereafter select any particular 
question of the class, the procedure will surely apply and lead us to a 
definite answer, either “yes” or “no”, to the particular question selected. 

Example 1 . Let f(x ) and g(x) be polynomials with given integral 
coefficients. Is f(x) a factor of g(x) ? We can divide g(x) by f(x). This di¬ 
vision is performable step by step by a preassigned method. It will 
terminate in a finite number of steps (how many will depend on how we 
count the steps, but this could be made precise so that the number will 
depend on the degrees of the polynomials and the size of the coefficients). 
We shall then have the remainder before us. This remainder we can rec¬ 
ognize to be either 0 or different from 0. If it is 0, the answer to the 
question is “yes”. If it is different from 0, the answer is “no”. 

Example 2 . Does the equation ax -f by — c, where a, b, c are 
given integers, have a solution in integers for x and y? There is a well- 
known method for answering the question, using Euclid’s algorithm. 

A method of this sort, which suffices to answer, either by “yes” or by 
“no”, any particular instance of a general question, we call a decision 
procedure or decision method or algorithm for the question. The problem of 
finding such a method we shall call the decision problem for the question. 
The problem appears in modern logic with Schroder 1895 , Lowenheim 1915 
and Hilbert 1918 . The present account is only introductory, and we shall 
attempt a more precise definition of what constitutes a decision method 
later (§§ 60, 61). For the present, it will be enough that we should be 
able to recognize particular examples of decision procedures. 
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Similarly, we may have a calculation procedure or algorithm (and hence 
a calculation problem) in relation to a general question which requires for 
an answer, not "yes” or "no”, but the exhibiting of some object. 

Now, in connection with a given formal system, such as the one we 
have been studying, there are some general questions, such as 'Is a given 
formal expression a formula?’ and ‘Is a given finite sequence of formal 
expressions a proof?’, for which a decision method is provided directly 
by the definitions establishing the system. In fact, this must be the case if 
the formalization by means of the system is to accomplish what was 
intended. Of a different nature is the question ‘Is a given formula prov¬ 
able?’ To see this difference, let us compare the definitions of the three 
notions: ‘formula’, ‘proof’, ‘provable formula’. For each of the three 
definitions, in applying it to a particular given object, we have to rec¬ 
ognize that the given object belongs to the class defined (if it does 
belong) through the consideration of a sequence of objects, namely re¬ 
spectively: the formulas obtained on the way in the construction of the 
given formula, the segments of the given proof, the formulas in a proof 
of the given provable formula. In the cases of formula and proof, this 
sequence of objects is contained within the given object, from which it 
can be regained for our consideration. But in the case of provable formula, 
this sequence of objects is not contained within the given object. Hence, 
for the last question, if a decision method exists, it must consist in some¬ 
thing else than a direct or nearly direct application of the definition, and 
the decision problem for this question is not trivial. It is often called 
the decision problem for the formal system. This problem is solved for the 
pure propositional calculus by Theorems 9 and 10 (§§ 28, 29): 

Theorem 12°. A decision procedure (or algorithm) for determining 
whether or not a proposition letter formula E is provable in the propositional 
calculus is afforded by the process of calculating the table for the function of 
t and f represented by E. According as only t’s occur in the value column or 
not, E is provable or not. 

Furthermore, the decision procedure extends to the other senses of 
formula, as we may first pass to a corresponding letter formula by si¬ 
multaneously replacing the distinct prime components by respective 
distinct proposition letters (Theorems 3 and 4 § 25). A decision procedure 
for equivalence is included in that for provability, by the definition of 
equivalence (§26), or can be based similarly on the Corollaries 1. The 
decision problem for deducibility is reduced to that for provability by 
noting, from the derived rules for D and & (Theorem 2 § 23) that, in 
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the propositional calculus, D 1; ..D ( \- E if and only if 
b Dj & .. D ( D E. (A second set of decision procedures is afforded 
by the process of reduction to p.d.n.f., end § 29.) 

Interpretation. Our metamathematical result that the propositional 
calculus admits an arithmetical interpretation, with two objects t, f in the 
arithmetic, illuminates the usual logical interpretation (cf. end of § 28). 
We see that our propositional calculus is a suitable logical instrument ( 1 ) 
when the particular propositions are such that each is definitely either 
true or false, or ( 2 ) when we wish to make it an assumption of a theory 
which we are developing that each is either true or false. For an in- 
tuitionist, Situation ( 1 ) is represented in the mathematics of a finite 
domain of objects, and also in reasoning with propositions about the 
objects of an infinite domain when those propositions are of a type for 
which there are decision procedures (cf. Remark 1 §29). The use of the 
propositional calculus in classical mathematics can be taken as an 
example of Situation (2). 

The truth tables now make it definite exactly how the operators 
of the calculus are to be interpreted as truth-value functions of propo¬ 
sitions. For example, we see that V is the inclusive ‘or’: A V B is 
true, if either A is true or B is true or both. (The exclusive 'or' is expres¬ 
sible thus: (A V B) & (A & B).) 

The implication A D B means the same as -iAVB (*59 § 27), and is 
called material implication. The holding of A 3 B does not require a 
necessary connection of ideas between A and B. For example, the moon 
is made of green cheese materially implies 2 + 2 = 5 (because the prem¬ 
ise is false). Fermat’s “last theorem” materially implies 2 + 2 = 4 
(because the conclusion is true). This is considered paradoxical by some 
writers (Lewis 1912 , 1917 ). Without attempting to enter fully into a 
controversial question, we offer the following brief remarks. The role of 
material implication is best understood, when it is considered in a wider 
context, such as that provided by the full number-theoretic system. 
Vx(A(x) D B(x)) expresses a relationship between A(x) and B(x) as 
variable propositions (or ‘propositional functions of x’), called formal 
implication. The property of material implication of holding whenever the 
first member is false, when the material implication is used in combination 
with generality to build a formal implication, allows a theorem 
Vx(A(x) D B(x)) to hold vacuously for certain values of x. This is a device, 
of a piece with the admission of 0 into the number system and of the 
vacuous set into the theory of sets, which conduces to simpler and more 
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comprehensive formulations of theorems. The same is true of the property 
of material implication of holding whenever the second member is true. 
From the standpoint of lay English, 3 is probably better rendered as 
"if ..., then ..." or "only if”. Notwithstanding, “implication" is a handy 
name for D. In using it, we follow the practice common in mathematics 
of employing the same designation for analogous notions arising in related 
technical theories. (An example is the many different kinds of "addition" 
and "multiplication" in mathematics.) The operator D does have the 
character of an implication in our formal system, in consequence of the 
two properties of it expressed by the deduction theorem and Rule 2 
(i.e. the two D-rules of Theorem 2 ). So it does represent logical conse¬ 
quence, not in some a priori sense, but in the sense defined for the formal 
system by the deductive postulates of the system. 

Other forms of the calculus. Inviewof *56—*61, the notational 
completeness of the propositional calculus (end of § 29) could have been 
obtained by taking -i and only one of the three other operators D, &, V 
as primitive operators for the calculus (formal symbols), and defining the 
other two of D, &, V from those as symbols of abbreviation (as ~ was 
defined in § 26). The still further reduction can be made to a single 
primitive operator | (called ‘alternative denial’, or the 'Sheffer stroke’, 
1913 *), with the table: 



a 

1 B 

s 

t 

f 

a t 

f 

t 

f 

t 

t 


Thence one defines -1 as d \ e2f, and <3 V B as (-« €1) | (- 1 B). 

The propositional calculus may be set up taking the substitution rule, 
which we derived as a subsidiary deduction rule in Theorem 3 § 25, 
instead as a direct postulated rule: 

E 

~E*7 

In this case, the proposition letters are called proposition variables, and 
particular axioms using the formal variables Z4, B, C can be used in place 
of the axiom schemata using the metamathematical variables “A”, 
"B”, “C”. The direct substitution rule is usually construed to apply to 
a single variable at a time (cf. Remark 1 § 25). 
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Example 3. We then have ^ D (2 D (3) as Axiom la. To construct 
thence a proof of A ~D (B D A), for any given formulas A and B, we 
proceed thus. If A does not contain S, first substitute A for < 3 ?, then B 
for S. If A contains $, let P be a proposition letter distinct from C4 and 
not occurring in A, and substitute successively P for B, A for <C7, B for P. 

This is the more usual method of setting up the propositional calculus ; 
the method we have chosen using axiom schemata is due to von Neumann 
1927 . In either case, the rules of inference must have the character of 
schemata, i.e. they must employ metamathematical variables, since 
infinitely many applications have to be provided for. For the calculus 
with the direct substitution rule, while the provability notion is the same 
(as the reader may demonstrate), the deducibility relation is more ex¬ 
tensive. The deduction theorem, and other subsidiary deduction rules 
depending on it, must then carry a restriction on the use of proposition 
variables in the subsidiary deductions, stated with reference to ap¬ 
plications of the new substitution rule exactly like the one we have stated 
with reference to Rules 9 and 1 2 for individual variables in the predicate 
calculus. 

With either of these variations in the way of setting up the propo¬ 
sitional calculus as a formal system (and there exist others), we still have 
essentially the same calculus. Allowing for differences in the selection of 
the operators, the class of provable formulas and the interpretation as 
an arithmetic of two objects remain the same. All these systems, and the 
common theory if we abstract from the particular formulation, may be 
called classical or two-valued propositional calculus. 

Other propositional calculi. There are other systems of proposi¬ 
tional calculus, which are propositional calculi in that they analyze 
propositions only with respect to how they are formed from other propo¬ 
sitions taken as wholes, and which have therefore a definition of formula 
of the same character (to within the particular choice of operators), but 
which differ essentially from the system studied here. 

One class of examples is obtained by generalizing from the two-valued 
propositional calculus to the n-valued propositional calculi for any 
positive integer n ;> 2. These for n > 2 were discussed by Lukasiewicz 
1920 [n = 3) and Post 1921 (any n). They are calculi which can be treated 
on the basis of truth tables in an arithmetic of n objects, as the 
classical system is treated on the basis of two. (Cf. e.g. Lukasiewicz and 
Tarski 1930 , Rosser and Turquette 1945 , 1949 , 1952 .) 

Another example is the intuitionistic propositional calculus (Heyting 
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1930), which is intended as a formalization of intuitionistic mathematical 
reasoning with propositions. As noted in § 23 , we obtain a postulate list 
for it (for the intuitionistic predicate calculus) from our postulate group 
A1 (group A) simply by replacing Axiom Schema 8 by Axiom Schema 8 r . 
This is not Hey ting’s original postulate list, but one suggested by Gentzen’s 
1934-5. (Heyting’s postulates for the predicate calculus are in 1930a.) 
Whether our numbered results and italicized theorems marked with the 
symbol “°” as not being established intuitionistically, so far as our dis¬ 
cussion has gone, actually do not hold for the intuitionistic system is a 
question which in each case requires further consideration. We shall return 
to this later in the book (§§ 80 , 82 ). The result of this section, that a 
decision procedure exists for the propositional calculus, is one such result 
which does hold for the intuitionistic system (Theorem 56 (d) § 80 ). 
It is known that in the intuitionistic propositional calculus none of the 
four operators can be expressed in terms of the remaining ones (Wajsberg 
1938, McKinsey 1939); and that the calculus cannot be treated on the 
basis of truth tables for any finite n (Godel 1932), but can be for 
n = N 0 (Jaskowski 1936*). 

Still further apart from the standpoint of the classical calculus studied 
here are the propositional calculi of strict implication (Lewis 1912), and 
the modal propositional calculi which deal with ‘possibility’, ‘necessity’, 
etc. (Cf. Lewis and Langford 1932, Feys 1937*8, McKinsey and Tarski 
1948, Feys 1965*.) 



Chapter VII 


THE PREDICATE CALCULUS 

§31. Predicate letter formulas. In this chapter, we study the 
part of the formal system obtained by using exactly the postulates of 
Group A. 

The propositional calculus, studied in the preceding chapter, is a 
formalization of logical relationships which depend only on the analysis 
of the way certain propositions are composed out of simpler propositions, 
using operations of composition in which the simpler propositions enter as 
unanalyzed wholes. 

In the predicate calculus, the analysis goes a step further, and we are 
allowed to consider also what may be called the ‘subject-predicate’ 
structure of the simpler propositions, and to use operations of com¬ 
position which depend on that structure. 

This analysis still does not take into account all features of the structure 
of number-theoretic propositions. This we can emphasize, just as in the 
preceding chapter, by introducing an alternative notion of formula, 
which eliminates the irrelevant details of the number-theoretic def¬ 
inition of formula, and leaves the way open to other applications as well. 

We start by introducing formal expressions of a new species, con¬ 
stituting a generalization of the proposition letters introduced in § 25 , 
as follows, 

d, l1{a), d{a, b ), ..., B, B(a), B{a,b), ..., C,C{a), C(a,b), - 

These expressions we call predicate letters (with attached or name form 
variables). Each of the symbols formerly used as a proposition letter forms 
a different predicate letter with each different number n ;> 0 of attached 
variables, and for n = 0 a predicate letter is a proposition letter. The n 
attached variables may be any n distinct variables. Different choices of 
the n attached variables with a given predicate letter are said to give 
different name forms of the same predicate letter, e.g. <^(a, b), <3(b, a) 
and <C?(c, d) are three name forms of the predicate letter formed by using 

with two attached variables, but Zi(a) and <3(a, b, c) are other pred- 
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icate letters, and < 3 l(a, a) is not a predicate letter. It ordinarily suffices 
throughout a discussion to consider each of the distinct predicate letters 
employed in the discussion as represented by just one name form, i.e. 
as taken with a fixed sequence of n attached variables throughout the 
discussion; and usually we take these n attached variables to be the first 
n variables in order from a given infinite list a x , a 2 , a 3 , ... of variables 
(usually here the list a, b, c, ...). 

For the definition of predicate letter formula, given inductively as fol¬ 
lows, the only terms shall be the variables. However we elect to say 
"term” at some places and "variable” at others, so that it will be clear 
how the discussion generalizes, if we later wish to allow a wider class of 
terms than simply the variables. 

1. If P(a 2 , ..., a„) is a predicate letter with attached variables, and 
fj, ..., t n are terms, then P(t 1; ..., t n ) is a formula. 2 — 5 . If A and B 
are formulas, then (A) D (B), (A) & (B), (A) V (B) and -i(A) are formulas. 
6 — 7 . If x is a variable, and A(x) is a formula, then Vx(A(x)) and 3 x(A(x)) 
are formulas. 8 . The only formulas are those given by 1 — 7 . 

Example 1 . By 1 , 3 l(b,a ), B, 3 i{a, b) and 31 (a, a) are predicate 
letter formulas. Here it suffices to start from the two name forms <31 {a, b) 
and B. By successive applications of 3 and 2 (omitting parentheses under 
the usual conventions § 17 ), 31 (b, a) & B and c 1 (b, a) & S 3 31 (a, b) are 
predicate letter formulas. Finally by 6, Vb( 3 l(b, a) &B D 32 {a, b)) is a 
predicate letter formula. 

For this chapter, when we say "term” and "formula” without specifying 
the senses, the words may be understood either in the respective senses 
of free variable and predicate letter formula, or else in their respective 
number-theoretic senses (§ 17 ). The two formal systems, having in com¬ 
mon Group A (§ 19 ) as postulate list, but differing thus in their formation 
rules, we distinguish as the pure predicate calculus and the number- 
theoretic predicate calculus. 

Before going ahead with the metamathematics, let us see how the 
formalism is interpreted as a calculus of predicates. 

In word languages, a proposition is expressed by a sentence. Then 
a ‘predicate’ is expressed by an incomplete sentence or sentence skeleton 

containing an open place. For example, "_ is a man” expresses a 

predicate. When we fill the open place with the name of a subject such as 
“Socrates”, a sentence such as "Socrates is a man” is obtained. The 
situation is conveniently described by using the modern mathematical 
notion of a ‘function’ (§ 10). The predicate is then a function of one vari- 
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able. This variable ranges over some domain, including as members 
Socrates, Chiron, etc. To each member of this domain, the function cor¬ 
relates a proposition; i.e. when the independent variable takes a member of 
the domain as value, the predicate takes a proposition as corresponding 
value. Thus the predicate is a propositional function of one variable. 
Predicates are often called ‘properties’, e.g. in Chapters II, III. In this 

terminology “_ is a man” expresses the property P of being a man, 

and ‘‘Socrates is a man” expresses the proposition that Socrates has the 
property P. Another term used in this connection is 'class'; “Socrates 
is a man” expresses that Socrates belongs to the class C of men. (‘Pred¬ 
icate’ in its strict grammatical meaning of that which a sentence says 
about its subject is narrower than ‘propositional function of one vari¬ 
able’ or ‘property’, since for a predicate the omitted noun in the sentence 
skeleton must be the subject of the sentence.) 

As a second illustration, consider the sentence skeleton _loves_ 

In grammatical terminology, this consists of a transitive verb and two 
open places, one to be filled by the name of a subject such as “Jane”, 
and the other of an object such as “John”. Whether the resulting sentence 
expresses a true or a false proposition we are not saying. The sentence 
skeleton in this illustration expresses a binary relation, i.e. a relation 
among two members, or in other words a propositional function of two 
variables. 

In other examples, there may be several correlated open spaces to be 

filled with the same name. For example, “_ t is the father of_ 2 , or 

_ i is the mother of_ 2 ” expresses a binary relation, also expressed by 

“_! is a parent of_ 2 ”. 

The reader may easily make up examples of sentence skeletons con¬ 
taining any greater number n of open places or of sets of correlated open 
places. Such a sentence skeleton expresses an n-ary relation, or propo¬ 
sitional function of n variables. 

From the standpoint of functions, the distinction between a ‘predicate’ 
in the traditional sense of the first illustration and a ‘relation’ is of minor 
significance; and likewise the distinction between ‘subject’ and ‘object’ 
in the first two examples. It will be more convenient henceforth to say 
simply “predicate” and "object” in all cases. By a predicate (of n variables) 
we shall accordingly mean a propositional function of n variables, where « 
may be 0 giving a proposition, or 1 giving a predicate in the traditional 
sense or a property, or > 1 giving an n-ary relation. We call the values 
of the independent variables (when n > 0) objects, and the independent 
variables object variables. 
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The predicate calculus will treat of the logic of predicates in this general 
sense of 'predicate’, i.e. as propositional function. Some writers hence say 
"functional calculus” instead of "predicate calculus”. 

We shall consider here only the case of predicate calculus which has 
one domain of objects for all its object variables, in which case the objects 
may be called also individuals, and the object variables individual 
variables. This case suffices for the intended application to our number- 
theoretic system, for which the domain is the set of the natural numbers. 

The treatment of the predicate calculus will not depend on any sup¬ 
position about the object domain, except that it be non-empty, i.e. 
contain at least one element. For the pure form of the calculus, no pro¬ 
vision is made for referring to particular objects of the domain, i.e. there 
are individual variables but no individual constants. 

Now let us see how we come to choose the symbolism which we use 
to represent predicates. The blanks employed above to show the open 
places in a sentence skeleton we replace by the device customary in 

mathematics of letters called “variables”. Thus instead of "_j is 

father of_ 2 , or_i is mother of_ 2 ”> we more conveniently write 

(1) " a is father of b, or a is mother of b” . Some mathematical examples 
are ( 2 ) “a is even”, ( 3 ) “a equals b”, and ( 4 ) “a is less than b”. 

Furthermore, since a predicate is a kind of a function, namely one 
whose values are propositions, we employ functional notation (§ 10) in 
naming predicates, except in cases when some other notation is in com¬ 
mon usage. Thus we might designate the predicate of (1) as “P{a, b)” 
(for “a is a parent of b”) and of ( 2 ) as “E(a)”, using the functional notation 
with the predicate symbol (“P” or “E”) written ahead of the independent 
variables. (We have already done so in § 7 in using " P(n )” to express that 
n has the property P.) For ( 3 ) and ( 4 ), we use the customary relational 
notations "a—b” and " a<b”, with the predicate symbol ("=” or 
“<”) written between the independent variables. 

For the pure predicate calculus, the predicate letters such as E3(a, b), B, 
etc. are to be interpreted as standing for unspecified predicates, i.e. 

b) for a predicate of two variables, B for a predicate of zero variables 
(i.e. a proposition), etc. Then any predicate letter formula can be inter¬ 
preted as standing lot a predicate which is determined by the predicates 
represented by the distinct predicate letters from which it is constructed, 
e.g. Vb(El(b, a) & B D <3(a, b)) represents a predicate of one variable 
(corresponding to the free a) determined by the predicate of two variables 
represented by c ~ 2 {a, b) and the proposition represented by B. 

Note that when we are using HX{a, b) as the name form for the predicate 
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letter <3 with two attached variables, then for the interpretation, after 
choosing independently what predicate FA{a, b) shall stand for, the 
meanings of <Fi{c, b), (a, a), < 3 {b, a), etc. depend upon that, by the 

standard convention for functional notation (§ 10). 

Similarly, any formula in the number-theoretic system can be inter¬ 
preted as expressing a predicate, under the usual number-theoretic 
meanings of the symbols. For example, 3 c{a= 0 ”-c) expresses E(a) 
or a is even, a=b expresses a=b, and 3 c(c' J r a=b) (abbreviated a<b 
in § 17 ) expresses a<b. 

Let Xj,..., x B be distinct variables, and A(x 1 , ..., x„) a formula (under 
either notion of formula). When we are interpreting A(x v ...,x„) by 
a predicate, or performing formal operations with it which are in keeping 
with an interpretation by a predicate (even though the interpretation is 
not involved in the formal operations), we call A(x lf ...,x n ) a name 
form in x lt ..., x„ as the name form variables, and say that x x ,...,x n 
have the name form interpretation or the predicate interpretation. The 
name form A(x lf ...,x„) is the formula of the system; "A(x 1 , .. .,x n )” 
is our metamathematical name for that formula (under our substitution 
notation § 18 ); and we may on occasion introduce “A{x lt ..., x n )” as a 
name for the predicate A{x lt . .., x n ) which the formula A(x p ..., x n ) 
expresses under the interpretation. 

It is natural to interpret a formula with free variables by a predicate, 
e'.g., when we are concerned with the formation rules of the system, and 
the formula in question is being considered as a constituent of other for¬ 
mulas. A discussion of the interpretations of a formula by a proposition 
will be given at the end of § 32 . 

§ 32 . Derived rules, free variables. In using the derived rules 
of Theorem 2 (§ 23 ) for the predicate calculus, we must observe carefully 
the restrictions on the handling of variables: (1) The t for an V-elimination 
or 3 -introduction must be free for the x in the A(x) (cf. § 18 ). (2) The x for 
an 3 -elimination must not occur free in the C. ( 3 ) In a subsidiary de¬ 
duction the free variables must be held constant for the assumption 
formula to be discharged (cf. § 22). At first parenthetical remarks will 
call attention to the precautions taken. Later it will be left increasingly 
to thfe reader to observe them. 

Theorem 13 . For * 64 —* 68 , let x x , ...,x n be distinct variables, 
A(xj, .... x„) be a formula, and t 1 , . . ., t n be terms {not necessarily distinct) 
which are free for x v ...,x„, respectively, in A(x x , ...,x n ). For * 67 , 
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also let C be a formula not containing any of x v ...,x n free, and r(x 1 , .,,, x„) 
be a finite sequence of zero or more formulas, and suppose that the free 
variables are held constant for the last assumption formula A(x l , ..., x n ) 
in the subsidiary deduction. Then : 

*64. A(x x , ..., x B ) b Xl '" x ” Vx x . .. Vx„A(x 1 , ..., x„). 

*65. Vx x ... Vx„A(x 1 , .... x„) b A(t xi ..., t„). 
*66. A(x x , ..., x„) b Xl '" Xn A(tj, ..t n ). 

*68. A(t x , ..., t w ) b 3x x ... 3x n A(x x , ..., x„). 

*67. If T(x x , ..x„), A(x x , ..., x n ) b C, then 
r(x x , ..x„), 3x x ... 3x n A(x x , ..., x n ) b Xl "‘ x " C. 
(n-fold V-introduction, V-elimination, substitution, 3-introduction 

and 3-elimination.) 

If x is a variable, and A(x) and B(x) are formulas: 

*69. A(x) D B(x) b x VxA(x) D VxB(x). 

*70. A(x) D B(x) b x 3xA(x) D 3xB(x). 

Proofs. *64. By n successive applications of the simple V-in¬ 
troduction rule (§ 23). 

For *65 and *66 (and later *68), we make two cases. Case 1 : t x , ..., t n 
do not contain x x , ..., x n . Case 2: otherwise. 

*65 Case 1 . By n-successive simple V-eliminations, 

Vx x ... Vx„A(x x , ..., x„) b Vx 2 ... Vx„A(t x , x 2 , .... x„) b 
Vx$ ... Vx n A(t x , t 2 , Xg, ..., x B ) b • • • b ^^nA(tj, . *., t w—x , x n ) b 
A(t x .t„). 

The case hypothesis and the hypothesis of the theorem that t x , ..., t„ 
are free for x x , ...,x n , respectively, in A(x x , ...,x„), together insure 
that t x is free for x x in Vx 2 ... Vx n A(x x , x 2 , ..., x„), t 2 for x 2 in 
Vx 3 ... Vx n A(t x , x 2 , x 3 , ...,x n ), etc. (Case 2 will follow *66 Case 1.) 

*66 Case 1 . By combining *64 and *65 Case 1 (or by n successive 
applications of the simple substitution rule § 23). 

*65 Case 2. Let w x , ..., w„ be variables distinct from each other 
and from x x , .... x„, and not occurring in A(x x , ..., x„) or t x , ..., t„. 
Then, by Cases 1 of *65 and *66, 

Vx x ... Vx„A(x x , ..x„) b A(w x , ..., w n ) b Wl " -tr " A(t x , ..., t„). 

*68 Case 2 . A(t x , ..., t„) b 3w x ... 3w„A(w x , ..., w„). Also 
A(w x , ..., w„) b 3x x ... 3x„A(x x , ..., x„), whence by *67, 

3 w x ... 3w„A(w x , ..., w„) b 3x x ... 3x n A(x x , ..., x„). 
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*69. 1. A(x) D B(x), A(x) b B(x) b x VxB(x) — D-elim., V-introd. 

2. A(x) D B(x), VxA(x) l- x VxB(x) — V-elim., 1 [the term x is 
free for x in A(x) by use of the definition § 18]. 

3. A(x) D B(x) j- x VxA(x) D VxB(x) — D-introd., 2 [the variable 
x is held constant in the subsidiary deduction 2 for the as¬ 
sumption formula VxA(x) which is being discharged, since x 
does not occur free in VxA(x)]. 

*70. 1. A(x) D B(x), A(x) b B(x) b 3xB(x) — D-elim., 3-introd. [cf. 
*69 Step 2]. 

2. A(x) ID B(x), 3xA(x) b x 3xB(x) — 3-elim., 1 [3xB(x) does not 
contain x free, and x is held constant in the subsidiary deduction 1 ]. 

3. A(x) D B(x) b x 3xA(x) D 3xB(x) — D-introd., 2 [x does not 
occur free in 3xA(x)]. 

The 3-elimination rule required at Step 2 for *70 is a subsidiary deduction 
rule, while the V-elimination rule used at Step 2 for *69 was established in 
the stronger direct form (although under an abbreviation introduced in 
§ 23 Example 1 the step is presented in the same format). Hence the 
greater care required in justifying Step 2 for *70. 

Example 1. V-introduction can be applied to the formula £l(a, a) 
in only one way, while 3-introduction (simple or 2-fold) can be applied in 
several ways, as follows. 

C4(a,a) \- a Va^l(a,a) — V-introd., letting x be a ; A(x) be <37(a,a). 

<£l(a,a) b 3a£l(a,a )— 3-introd., letting x be a; A(x) be d(a,a ); t be a. 

d(a,a) b 3 besf(a,b )— 3-introd., letting x be h\ A(x) be <37(d,/>); t be a. 

^(a,a) b 3 b<3(b,a )— 3-introd., lettingxbe b; A(x) be <37(/>, a); t bed. 

<3(a,a) b 3a3b^2(a,b) — 2-fold 3-introd., letting x 1( x 2 be a, b; A(x x , x 2 ) 
be <3(a,b ); t x , t 2 be a, a. 

c%(a,a) b 3a3b<Pl(b,a) — 2-fold 3-introd., letting x x , x 2 be a, b\ A(x x , x 2 ) 
be <37 (b,a ); t x , t 2 be a, a. 

Interpretation of formulas with free variables. The restriction 
for our subsidiary deduction rules of Theorem 2, that the free variables 
should be held constant for each assumption formula to be discharged, 
can be illuminated by some remarks on the interpretation, which of 
course are not part of the metamathematics. 

In Example 2 following, we have in Step 1 a deduction from the as¬ 
sumption formula b^O with b held constant, so that D-introduction is 
applicable at once. In Example 3, D-introduction is not applicable at 
once because b is varied, although it can be applied after an V-elimination. 
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In Example 4, we see how a false result (for the number-theoretic inter¬ 
pretation) is obtainable, if we violate the restriction on subsidiary de¬ 
duction. 

Example 2. 1. b^O b a+b^a — can be established, holding b 

constant, in the number-theoretic system (§39; “s^t” abbreviates 
-is=t, § 17). 

2. b b^O a+b^a — D-introd., 1. 

3. b Vb(b^0 D a-\-b^a) — V-introd., 2. 

Example 3. 1. b=£0 b* 0^0 — substitution (§ 23), b being varied. 

2 . Vb{b^0) b 0^0 — V-elim., 1. 

3. b Vb(b^O) D 0^0 — D-introd., 2. 

Example 4. 1. b^0 b^ 0^0 — same as Example 3 Step 1. 

2? b b^0 D 0^0— D-introd., misapplied to 1. 

3? b Vb(b^ 0 D 0^=0) — V-introd., 2. 

4? b 0V0 D 0^0 — V-elim., 3. 

5. b O'— by substitution in Axiom 15. 

6 ? b 0^0 — Rule 2, 5, 4. 

The only violation of the formal rules in Example 4 is at Step 2. It is 
suggested that the reader undertake himself to explain the fallacy in 
terms of the interpretation, before reading the further discussion below. 
Especially he should note the difference between the formulas at Step 3 
of Examples 3 and 4. The reader may also supply the formal details 
for the following two examples, and compare the results. 

Example 5. Given A(x) b B and A(x) b ~iB with x held constant, 
then b -| A(x) and b Vx-iA(x). 

Example 6 . Given A(x) b x B and A(x) b x ~iB with x not neces¬ 
sarily held constant, then b -iVxA(x). 

The rules of V-elimination and 3-elimination may be discussed in like 
manner. 

In informal mathematics, we know two different ways of using free 
variables in stating propositions, as illustrated in algebra by an identical 
equation (x + y) 2 = x 2 + 2 xy + y 2 and a conditional equation x 2 -j- 2 = 3x. 

The first of these interpretations is the one which applies to the free 
variables in the axioms and formal theorems of our system, and we 
call it the generality interpretation. For example. Axiom 14, which is 
a'=b' D a=b, means that, for every pair a and b of natural numbers, 
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if a' = b', then a = 6; and the formal theorem a=a, proved in Example 
1 § 19, expresses that every natural number equals itself. 

But when a formula A(x) with a free variable x is taken as an assump¬ 
tion formula for a formal deduction, we have a choice. We may intend the 
assumption in the sense “Suppose that, for all *, A (x)”, so that x has the 
generality interpretation. Or we may intend the assumption in the sense 
“Let x be a number such that A(x)”, in which case we say x has the con¬ 
ditional interpretation. 

In the second case, the use in the deduction of operations which depend, 
considered in terms of the meaning, on the possibility of allowing x to 
range over the object domain will be out of keeping with the interpreta¬ 
tion. For the generality interpretation, there is no such limitation. Step 1 
Example 2 is a deduction constructed in keeping with the conditional 
interpretation; and Step 1 Examples 3 and 4 only with the generality 
interpretation. That the assumption formula b^O is false under the 
latter interpretation is beside the point here, and the concluding formula 
of Example 3 is quite correct (although not very interesting). 

The student of elementary mathematics is acquainted with the dis¬ 
tinction between symbols classified as constants and symbols classified as 
variables. Close inspection shows that the distinction in the use of the 
symbols is always relative to a context. A given symbol is introduced as 
name for an object, and throughout a certain context every occurrence 
of the symbol is as name for the same object. From outside the context, 
it is indicated that the object may be any one (some one, etc.) of the 
members of some set. Fundamentally then, the symbol is constant, i.e. 
its meaning cannot be changed, within the context, while from outside 
the context it is variable. 

(The terminology actually employed in a given theory is generally that 
suitable to the context constituted by the theory as a whole. Sometimes 
symbols which are constant throughout an important subcontext, but 
variable for the theory as a whole, are called “parameters” or “arbitrary 
constants”.) 

For the generality interpretation of a variable x in a formula A(x), the 
context within which all (free) occurrences of x must represent the same 
object is exactly the whole formula A(x). The formula A(x) then means 
the' same as VxA(x), and in analogy to the scope of the quantifier Vx, we 
also call A(x) the scope of the generality expressed by the free variable x. 

For the conditional interpretation, the context within which all (free) 
occurrences of x have the same meaning is not just A(x) but the whole 
deduction from A(x) (or the part of it dependent on A(x)). 
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In Example 4 Step 1, under the generality interpretation the scope 
of the generality expressed by b is exactly the assumption formula b ^0. 
Were the formula of Step 2 provable, the generality interpretation would 
then apply to it as a whole, and the scope would become b=£ 0 D 0^0, 
not just the part b ^ 0. 

The universal quantifier Vx functions in our logical symbolism as 
a device to restrict the scope of generality to a part of a formula. The 
formulas of Steps 2 and 3 Example 4 (or of the two conclusions of Example 
5) are synonymous under the generality interpretation of the free variable 
in the first. No formula without a quantifier can be written which is 
synonymous with the formula of Step 3 Example 3 (or of the conclusion 
of Example 6). 

Let A be a formula containing free exactly the distinct variables 
x x , ..., x n in order of first occurrence. According as n > 0 or n — 0, 
we call A open or closed. The closed formula Vx x ... Vx n A (sometimes 
abbreviated “VA”) we call the closure of A. By *64 and *65, A and VA are 
interdeducible (i.e. each can be deduced from the other), with x x , ..., x n 
varied in the deduction of VA from A. Under the generality interpretation 
A and VA are synonymous. 

§ 33. Replacement. Let C A be a formula in which there is a specified 
occurrence of a formula A, and let C B be the result of replacing this 
occurrence by B (§ 26). There will then exist parallel constructions of 
from A and of C B from B by Clauses 2 — 7 of the definition of formula 
(§ 17 or § 31). The number of steps in this construction of C A (or of oper¬ 
ators in the scopes of which the part A lies) we call the depth of the part 
A in C A . 

Example 1 . Let A be £Z(b,a), C A be Vi>(<fT(i>, a) &B 3 <H(a, b)) 
(the specified occurrence here being the only occurrence), and B be 
3d<Zl(d, a, c). Then C B is 'ib(^ddd(d, a, c) & B D <^(a, b)). The parallel 
constructions of C A from A and of C B from B are as follows, and the 
depth is 3. 

<3( b , a) 3 ddX{d, a, c) 

^{b, a) & B 3 d^(d, a,c)&B 

c Z(b, a)& SD b) 3da(d, a,c)&Bz> <3{a, b) 

Vb^ib, a) & B D a { a , b)) Vb{lddX{d, a,c)8cBz> <3{a, b)) 

Theorem 14. If A, B, C A and C B are formulas related as in the foregoing 
discussion of replacement, then 

A ~ B h x ‘- x » C A ~ C B 
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where x x , ..., x n are the free variables of A or B which belong to a quantifier 
of C A having the specified, occurrence of A within its scope. (Replacement 
theorem.) 

In other words, Xj, ..., x n are the free variables of A ~ B which 
are quantified in the construction of C A from the specified part A. Proof 
of the theorem is by induction on the depth as before (Theorem 6 § 26), 
using now two additional lemmas. 

Additional lemmas for replacement. If x is a variable, and A(x) 
and B(x) are formulas: 

*71. A(x)~B(x) b x VxA(x) ~VxB(x). 

*72. A(x)~B(x) b x 3xA(x) ~ 3xB(x). 

Proofs. From *69 and *70, respectively. 

Example 1 (concluded). Let ~ be written between the formulas 
of each pair in the parallel columns. The resulting formulas are deducible, 
each from the preceding, using successively *28a, *26and*71 (varying/;). 

Corollary 1 . Under the conditions of the theorem: 

If b A B, then b C A ^ C B . 

Example 2°. By *49 (and *20), b A(x) ~-i-iA(x). Hence 
b tiVxA(x) ~ -i-iVx-i-iA(x). 

Example 3. Now (cf. Example 2 § 26), if A and B do not contain 
x free: A ~ B b A V Vx(A D C(x)) ~ A V Vx(B D C(x)). If A and B 
may contain x free: A ~ B b x A V Vx(A D C(x)) ~AV Vx(B D C(x)). 
If b A~B, then b A V Vx(A D C(x)) ~ A V Vx(B D C(x)). 

Corollary 2 . Under the conditions of the theorem: 

A ~ B, C A b* 1 '"*” C B , with x x , ...,x n varied only for the first as¬ 
sumption formula. If b A^B, then C A b C B . (Replacement property of 
equivalence.) 

A replacement may be preceded by a substitution for individual 
variables (* 66 ). 

Example 4. b + 0 =a ~ a=b b* b '+ 0 =a ~ a = b ’ b A 
3 b ( b '-\- 0 — a ) ~ lb ( a = b ’). But in § 38 we will have b b -{- 0 =a ~ a = b . 
Hence then b 3 b ( b '-\- 0 = a ) ~ 3 b ( a — b '). 

The interpretation illuminates the treatment of the variables in re¬ 
placement. In informal mathematics, knowing that sin x is the same 
function as cos (tt/2 — x), i.e. having sin x = cos (tc/2 — x) as an identical 
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equation (with * taking the generality interpretation § 32), we are justified 
in replacing "sin x” by "cos (n/2 — x)” in “/£ sin x dx” , and in replacing 
"sin 2x” by "cos (tc/2 — 2x)”. But assuming the conditional equation 
sinr= \ —x gives us no right to replace "sin x” by “l—x” in 
'To s i n x or to replace "sin 2x” by "1 — 2x”. 

Change of bound variables. Two formulas A and B will be said 
to be congruent, if A and B have the same number k of symbols, and 
for each i (i — 1 , . .., k) : (I) If the i-th symbol of A is not a variable, 
then the t-th symbol of B is the same symbol. (II) If the *-th symbol of 
A is a free occurrence of a variable, then the *-th symbol of B is a free 
occurrence of the same variable. (Ill) If the t-th symbol of A is an oc¬ 
currence of a variable bound by the j- th quantifier of A, then the i-th 
symbol of B is an occurrence of a variable (not necessarily of the same 
variable) bound by the j- th quantifier of B. 

In brief, two formulas are congruent, if they differ only in their bound 
variables, and corresponding bound variables are bound by corre¬ 
sponding quantifiers. 

Example 5. The two following formulas are congruent: 

Va{3{a, c) V 3 aS{a) D 3 bC(a, b)), 'ib{a{b, c) V 3 cB{c) => 3 aC{b, a)). 
This can be made apparent by introducing indices to show which oc¬ 
currences of variables are bound by the same quantifier: 
Va 1 (£l(a 1 s:)V3af(aj33bj:{a l ,bJ), 'ib l {&{b v c) V3 c 2 S(c 2 )D 3 a 3 C(b v a 3 )). 
If we erase out the bound variables themselves, leaving blanks numbered 
with the indices, identical expressions will be obtained. 

Lemma 15a. If x is a variable, A(x) is a formula, and b is a variable 
such that (i) b is free for x in A(x), and (ii) b does not occur free in A(x) 
( unless b is x), then : 

*73. H VxA(x) ~ VbA(b). *74. 3xA(x) ~ 3bA(b). 

Proofs. *73. By Example 3 § 22 (Steps 1—2), (- VbA(b) 3 VxA(x), 
and similarly \- VxA(x) D VbA(b). — Note by Example 9 §18 that 
(i) and (ii) are necessary and sufficient in order that VbA(b) be congruent 
to VxA(x) (or 3 bA(b) to 3xA(x)). 

Lemma 15b. Congruent formulas are equivalent ; i.e. if A is congruent to 
B, then 1 - A ~ B. 

Proof. Let A contain in order exactly r quantifiers with respective 
variables u x , ..., u r (not necessarily distinct). Let w x , ..., w r be distinct 
variables occurring neither in A nor in B. Let C come from A by changing 
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each u, (j — 1, ..., r) to w* in exactly the occurrences which the /-th 
quantifier binds. By r successive replacements, using *73 or *74 and 
Corollary 1 Theorem 14, |- A ~ C. Similarly, h B~C. Hence (*20, *21), 

(- A'—' B. 

Example 5 (concluded), b 'ia{LL{a, c) V 3aB{a) 3 3 bC{a, b )) ~ 
W(eT(</, c) V 3aB{a)Z>3bC{d, b)) [*73] ~ Vd{Tl{d,c) V 3eB{e)=>3bC{d, b)) 
[*74] ~ W(<3(</, c) V 3eB(e) 3 3fC{d, /')) [*74], Similarly 
b Wb{^{b, c) V 3c2(c) 3 3 aC{b, a)) ~ Vd{Tl{d, c) V 3eB{e) 3 3 fC{d, f)). 

Remark 1 . (a) Similarly to Theorem 14, by using *6 — *9b, *12, *69, 

*70 (instead of *26—*30, *71, *72) as the lemmas: Let the part A stand in C A 
within the scopes of only certain of the symbols 3, &, V, —i, V, 3. Then 
A 3 B b Xl '" x ” C A 3 C B or B 3 A b Xl '" x " C A 3 C B in the system having 
as postulates only the 3- postulates and the postulates for the symbols in 
question, provided that in case the symbols include V but not & the V -postulates 
include Axiom Schema 9a of Lemma 11 § 24. (Cf. Herbrand 1930 § 3.2, 
MacLane 1934 pp. 28 ff., Curry 1939 pp. 290—291.) 

(b) b VxA(x) 3 VbA(b) and b VbA(b) 3 VxA(x) {similarly with 3) ) 
using only Postulates 9 and 10(11 and 12). (c) Therefore: If A is congruent 
to B, then b A 3 B and b B 3 A {and hence A and B are interdeducible) 
using only the 3- postulates and the postulates for {at most) the logical 
symbols which A contains, provided as in (a). 

Permanent abbreviations. Our use of permanent abbreviations, 
such as “a<b” (discussed at the end of § 17) will differ from that of 
temporary abbreviations, such as “A(x)”, "A(x lt ...,x n )”, etc. (§ 18), 
in two respects. First, they shall not contain free variables not shown in 
the abbreviation (‘anonymous free variables’). Second, instead of avoiding 
the substitution of terms not free at the substitution positions, we permit 
the bound variables suppressed by the abbreviation (‘anonymous bound 
variables') to be chosen at will to make whatever terms we wish to sub¬ 
stitute free at the substitution positions. All legitimate unabbreviations 
of a given abbreviation are congruent, and hence by Lemma 15b equiva¬ 
lent. Thus it is immaterial in considering questions of deducibility and 
provability which legitimate unabbreviation is used. 

For the question whether a postulate applies, the manner of un¬ 
abbreviating may make a difference, e.g. s<t 3 (B 3 set) is an axiom 
by Schema la only if both occurrences of “s<t” are unabbreviated alike. 
Hereafter in our statements that a postulate applies we shall be tacitly 
supposing that like appearing abbreviations are unabbreviated alike. 
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*§ 34. Substitution. The use of a formal substitution rule for 
predicate letters can be largely avoided by stating results in the schematic 
form, with metamathematical letters instead of particular predicate 
letters. We then substitute informally in applying the results with a 
change in the signification of the metamathematical letters, but this 
substitution does not constitute application of a formal substitution rule. 
We have been doing this continually, from the very beginning of our study 
of the formal system. One new example is given to show what is meant. 

Example 1°. As *83 (§ 35) we will establish that, if x is any variable 
and A(x) any formula, p 3xA(x) ~ -iVx-iA(x). Now let x be any 
variable and A(x) any formula. By taking the negation -iA(x) of this 
A(x) as the A(x) of *83, we have (- 3x -i A(x) ~ -iVx-i^iA(x). This 
and Example 2 § 33 explain the second and third steps of the following 
chain: (- VxA(x) ~ -i-iVxA(x) [*49] ~ -i-iVxt-iA( x) [*49] 

~ -i3x-iA(x) [*83]. 

The only essential use we shall make of the formal substitution rule 
for the predicate calculus (Theorem 15) is in establishing duality (Corol¬ 
lary Theorem 18 §35), where we substitute negations of the predicate 
letters for the letters. That substitution can be justified by reasoning 
already used in proving the substitution rule for the propositional calculus 
(Theorem 3 § 25), with no new complications. A further application of 
formal substitution occurs in passing from a number of results, first proved 
by duality in terms of particular predicate letters, to the general results of 
the same form with metamathematical letters. That application could 
be avoided by using the substitution rule only heuristically, to discover 
proofs that we can afterwards validate without use of it. The reader may 
therefore, if he wishes, omit the detailed treatment of substitution given 
in the remainder of this section. 

A generalized notion of ‘occurrence’ is appropriate to the name form 
interpretation. In informal mathematics, “sin x” as an expression for a 
function occurs in “3 sin x + cos x”, in “/<{ sin x dx" and in “cos x sin 2x”, 
although as an expression for a number it occurs only in the first. 

To simplify the notation in this section, we shall analyze each sub¬ 
stitution taking the attached variables for each of the distinct predicate 
letters to be the first n variables from an infinite list a 1; a 2 , a 8 , ... of 
variables (cf. § 31). However this list may have to be chosen differently 
for different substitutions (see below). 

By an occurrence of a predicate letter P(a 1; ..., a„) with attached 
variables in a predicate letter formula E we shall mean a (consecutive) 
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part of E of the form P(t 1 , ..., t„) where tp ..., t n are terms. A predicate 
letter formula E is said to be a predicate letter formula in the distinct 
predicate letters 

(1) P x (ap ..a^), ..P m (a 1 , ..a»J n m ^ 0; m ^ 1 ), 

if no predicate letters other than ( 1 ) occur in E. 

Example 2. The predicate letter <^(a, b) occurs twice in a) 

& B D c il{a, b)), first as the part c%(b, a) and second as the part <3(<x, b). 
The formula V/>(J?(/>, a) & B ID <^{a, b)) is a predicate letter formula in 
^(a, b), B, C(a, b, c). 

The substitution of formulas (in either sense §31) 

(2) A 1 (a 1 , > • Rn^) t • • • > A m (ap • * * i <bi m ) j 

considered as name forms in the respective variables shown, for the pre¬ 
dicate letters ( 1 ) in E (with result E*) shall consist in replacing, simul¬ 
taneously for each j (j = 1 each occurrence P,(t 1( ...,t w? ) of 

P 3 (ap ..., a» y ) in E by A,(t 1( ..., t n . f ). 

The name form variables (i.e. the a’s) do not appear as such in E and 
E*. In asking whether a given formula E* does come from another given 
formula E by substitution for certain predicate letters, it suffices to ask 
the question taking as the name form variables ones not occurring in 
E and E*. though we are not restricting ourselves to such a choice of the 
name form variables when others will do also. 

The substitution is said to be free, if for each j (j — 1, ..., m), 
A 3 (ap ..., a nj ) is ‘free’ for P i (a 1 , ..a TCy ) in E, in the following sense: 
A(ap ...,a n ) is free for P(a 1( ...,a n ) in E, if, for each occurrence 
P(tp ..., t„) of P(ap .... a n ) in E, (Al) tp ..., t n are free for ap • •. i a„, 
respectively, in A(a x , ..., a n ), and (A 2 ) P(tp ..., t„) does not stand in 
E within the scope of a quantifier Vy or By where y is a free variable of 
A(ap ..., a„) other than one of the name form variables a lf ..., a„. 

Example 3. Let m — 1 ; n — n x = 2 ; ap a 2 be c, d\ P(a x , a 2 ) be 
<3(c, d ); A(ap a 2 ) be V/>B(d, b, c, d)V ^(d, c); and E be 3 cei%(c,a). 
Then E* is 3c(VbB(a, b, c, a) V <HZ(a, c)). The substitution is free. 

It is convenient to refer to the free occurrences of a J( ...,a„ in 
A(a x , ..., a n ) as explicit occurrences; and to other occurrences of variables 
in A(a x , ..., a n ) as anonymous occurrences. Variables occurring ex¬ 
plicitly [anonymously] free (bound) in A(a x , ..., a„) are explicit [ anony¬ 
mous ] free {bound) variables of A(a x , ...,a„). The terminology extends 
generally to situations in which formulas or parts of formulas are being 
represented by metamathematical letters. 
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Example 4. In considering d{a, b) & 3 aS(a, b, c ) as a name form 
in a, b, and in using “A (a, b)" (or “A(a 1( a 2 )” where , ‘a 1 ”, "a 2 ” stand for 
a, b) to stand for it, the first occurrence of a and both occurrences of b 
are explicit, the second and third occurrences of a and the occurrence of 
c. are anonymous. So a and b are explicit free variables, c is an anonymous 
free variable, and a is an anonymous bound variable. In using ,<s 4aA.{a, b)" 
to stand for 'ia(c^(a, b) & 3 a'B(a, b, c)), the first two occurrences of a are 
explicit, the other two anonymous; so a is both an explicit bound variable 
and an anonymous bound variable. The V<a is an explicit quantifier, and 
the 3 a an anonymous quantifier. 

In our further examples of substitution, the variables a lt a 2 , a 3 , ... 
will be a, b, c, .... (This choice can always be made, except when as 
in Example 3 it would interfere with the anonymous variables for the 
substitution.) 

Failure of (Al) or (A2) is always due to the presence of anonymous 
variables. 

Example 5. The formula 3cd7(c, a, b) is not free for i/1(a) in i1(c) 3 S, 
because (Al) is violated (after substitution, with result 3ce2?(c, c, b) 3 S, 
the c of <3\c) would become bound by the anonymous quantifier 3c of 
3c<^{c, a, b)), nor in Vb(<3(a) 3 8(b)), because (A2) is violated (after 
substitution, with result V/>(3cc3?(c, a, b) D S(b)), the anonymous free b of 
3cc27(c, a, b) would become bound by the of 'ib{^{a) 3 B(/>))). 

The conditions (Al) and (A2) can be regarded as conditions that each 
part A(tj, ..., t„) ofE*resulting by the substitution of A(a x , ..., a n ) for 
P(a 1 , ..., a n ) should constitute an occurrence of A(a 1 , ..., a „) as a name 
form in a x , .... a n . 

Example 6 . The a’s in this example shall be a, b, c. But we supply 
indices to assist in referring to different occurrences of the variables. 
Let E be 

(i) V^(^ 2 , a 3 ) & S 3 a(a A , b,)). 

Let A (a, b), B, C (a, b, c) (to be substituted for d{a, b), S, C(a, b, c), 
respectively) be 

(ii) 3 c s C(c v a, b, b), -i S(a 8 ), a, b) 

(the indexed occurrences of variables being anonymous). Then E* (the 
result of the substitution performed on E) is 

(iii) Wq(3c 6 C(c 7 , b 2 , a 3 , a 3 ) & -i S(a B ) 3 3c ti C(c 7 , a 4 , b 5 , b 5 )). 

The substitution is free. 
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The meaning of the next lemma will be made clear by the example 
following it. 

Lemma 16a. If the substitution of (2) for (1) in E is free, then in the 
result E* each free occurrence of a variable originates as a free occurrence of 
the variable either in E or anonymously in some A,(a 1( ..a„ ; ), and each 
bound occurrence of a variable and the quantifier binding it originate together 
in the same relationship either in E or anonymously in some A^(a 1 , ..., a*,). 

Example 6 (concluded). The two free a 3 s in (iii) originate as the 
free a 3 in (i). The free a 8 originates as the anonymous free a 8 in (ii). The 
two b s ’s bound by V/> 1 originate together as the b 5 bound by Yb 1 in (i). 
The c 7 bound by 3c 6 (either such pair) originate as the anonymous c 7 
bound by 3c, in (ii). 

Outline of proof. Consider a given occurrence of a variable in E*. 
This is either (Case 1) not in any of the parts A^tp .... t n? ) (e.g. b^), or 
(Case 2) in one of the parts A y (t lf ..., t» ; ) but not in any t f (e.g. c„ c 7 , a 6 ), 
or (Case 3) in one of the occurrences of t, which is introduced into 
A^tj, .... t n ,) by substitution for a free occurrence of a* in A # (a lf ..., a n/ ) 
(e.g. b 2 , a 3 , a it b 5 ). The lemma follows, using in Case 2 the condition (A2) 
for freedom, and in Case 3 the condition (Al). 

Lemma- 16b. Let 

(2) • • •» ^ij)> • • •> • • • j ^«J 

be formulas respectively congruent to the formulas (2), and let F be a formula 
congruent to F. If the substitution of (2) for (1) in F with result F*, and the 
substitution of ( 2 ) for (1) in F with result ft, are both free, then F f is con¬ 
gruent to F*. 

By Lemma 16a, noting that, if the p -th symbol of F* originates as 
the q-th. of F (of A 7 (a x , .... a n/ )), the p- th symbol of Ft originates as the 
?-th of F (of Ay(a x , ..a n; )). 

Lemma 17. Given a proof of F, and a list of variables z 1( ..., z Q , we 
can find a formula F congruent to F and containing none of the variables 
Zj, ..., z 9 bound, and a proof of F containing no applications of Rule 9 
or 12 with respect to any of z v ..., z 4 . 

Proof. Let the distinct free variables of F be b lf ..., b s , and call 
F also “F^bj, ...,b,)”. Let u 1( ...,u r be all the distinct variables 
occurring free or bound in any formula of the given proof of F (including 
b 1; ..., b„). Let u 1( ..., u r be new variables, distinct from each other 
and from u x , ..., u r , z v ..., z 9 . For the definition of what constitutes 
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a proof in the predicate calculus (Postulate Group A § 19), all variables 
are on a par initially. Hence if, throughout the given proof of F, we change 
Uj, ..., u r simultaneously in all occurrences free and bound to u x , ..., u r , 
respectively, the resulting figure must also be a proof. Say it is a proof of 
P(b 1 , ..., b s ); so (a) b F(bj, • • •. b,). Now by rc-fold substitution (*66), 
(b) F(b!, ..., b s ) b^i ••• b» P(b 1( ..., b,). Referring to the proof of *66, 
since b x , ..., b„ are distinct from b x , ..., b„ the deduction (b) requires 
the use of Rule 9 only with respect to b x , ..b, (and Rule 12 not at all). 
Let F be f^bj, ..., b s ). Combining (a) and (b), we obtain a proof of F in 
which Rules 9 and 12 are used only with respect to the new variables 
u lf ..., u r (including b 2 , .... b,), therefore not with respect to any of 

Zj, . . . , Z q. 

Theorem 15. Substitution for predicate letters. LefD v .. .,D ( , E 
be predicate letter formulas in the distinct predicate letters (1). Let D*,... ,D*, 
E* result by the substitution of (2) (as name forms in the variables shown) for 
(1) throughout D x , ..., D i( E, respectively. Then, provided that 

(A) the substitution is free, and 

(B) for j = 1, ..., m, the anonymous free variables of A 3 (a lf ..., a ny ) 
are held constant in the given deduction for each assumption formula which 
contains the corresponding predicate letter P^a^ ..., a n ,): 

If D x .D, b E, then D*.D* b E*. 

(Proviso (B) is of course satisfied, if all the anonymous free variables 
of (2) are held :onstant in the given deduction. In the case 1 = 0, Proviso 
(B) disappears, and we have simply: If b E, then b E*, provided (A) 
the substitution is free.) 

Proof. We take the given deduction D 1( ..., D, b E as (I) for 
Lemma 8a (§ 24), and pass to a proof (II). The long formula in (II) we 
now write as "F”. The substitution of (2) for (1) in F is free, as we see by 
using Proviso (A), and also Proviso (B) to insure that the condition (A2) 
is met respecting the y’s of (II). If we can show now that b F*, then by 
using Lemma 8a in the converse direction, it will follow that D*,.. . ,D* 
b E*, as is to be proved. 

Accordingly, consider a given proof of F. In formulas of this proof 
there may occur some other predicate letters than (1). Let (F) be the list 
(1) increased to include these; and let (2') be (2) increased correspondingly, 
using as the additional name forms formulas which contain no anonymous 
variables. 

Suppose we were to substitute (2') for (1') throughout the given proof 
of F. Then we could reason, exactly as in the proof of the substitution 
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rule for proposition letters (Theorem 3 § 25), that the resulting figure is a 
proof of F*, except for two contingencies. 

First, the C for an application of Rule 9 or 12 may be transformed by 
the substitution into a formula C* which contains the x of the application 
free, so that the rule no longer applies. This can happen only if the x of 
the application is one of the anonymous free variables z v . .z Q of (2). 
Using Lemma 17, we can replace the given proof of F by a proof of a 
formula F congruent to F and containing none of z lt . . ., z a bound, so 
that in the new proof there are no applications of Rules 9 and 12 with 
respect to z v ..., z a . 

Second, the t for an application of Axiom Schema 10 or 11 may after 
the substitution no longer be free for the x in the A(x), due to the in¬ 
troduction of anonymous quantifiers in the formulas (2) with variables 
occurring in the t. Let us choose formulas (2') congruent to (2') and 
containing bound no variables occurring either free or bound in any for¬ 
mula of the proof of F. 

Now if we substitute (2') for (1') in the proof of F, neither contingency 
can arise; and so, denoting the substitution of (2') by “t”, the resulting 
sequence of formulas will be a proof of FL Thus b f‘ t . 

By the choice of the bound variables in (2'). condition (Al) for freedom 
is satisfied in the substitution of (2) for (1) in F. Because F does not 
contain bound any of the anonymous free variables z v ...,z Q of (2), 
hence of (2), condition (A2) is satisfied also. Thus the substitution of (2) 
for (1) in F is free. So is that of (2) for (1) in F (as remarked above). Hence 
by Lemma 16b, F + is congruent to F*; and by Lemma 15b, b F + ~ F*. 
This with b (and *18a) gives b F*, as remained to be shown. 

Example 7°. We shall prove (Example 2 § 35) that 

(a) b ^?VVaS(d) ~ V S(a)). Thence by Lemma 15b, 

(b) b V VxS(x) ~ Vx(<£? V S(x)), where x is any variable. Thence 
by Theorem 15, (c) b A VVxB(x) ~Vx(AVB(x)), where B(x) is any 
formula, and A any formula not containing x free (as otherwise (A2) for 
Proviso (A) would be violated). This is *92 with the same stipulations as 
appear in Theorem 17. From (b) we easily infer (d) <^7 V VxS(x) b VS(x) 
and (e) J7 V S(x) b x ^7VVxS(x). Substitution for cC7 of a formula A 
containing x free is permissible in (d), but Proviso (B) prevents it in (e). 

The situations which the provisos of the substitution rule, or the 
stipulations on our metamathematical letters, prevent always involve a 
variable occurring both anonymously and explicitly. A blanket rule that 
could be used, in place of the more detailed conditions stated from case 
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to case, is simply that the anonymous variables be distinct from the 
explicit variables. This is of course a little more restrictive than is nec¬ 
essary, e.g. anonymous bound x’s in the A of *92 are clearly innocuous. 

Remark 1 . If E, E are predicate letter formulas in the predicate letters 
(1), and T (- E, then there is a deduction of E from T in [each formula of) 
which no predicate letters other than (1) occur. For in the given deduction 
we can substitute (2') for (1'), where (2) is the same as (1), and the ad¬ 
ditional name forms in (2') contain only (1), e.g. each of them can be 
Vxj ... Vx„ j P 1 (x 1 , ..., x ni ). 

Converse of substitution. A formula A(a p ..., a„) will be called a 
prime name form (in the distinct variables a 1( ..., a n ), provided (i) it has 
none of the forms A D B, A & B, A V B, -iA, VxA(x) or 3xA(x), where 
A and B are formulas, x is a variable, and A(x) is a formula, and (ii) it 
contains exactly the variables a x , .... a n . Two prime name forms 
A^aj, ..., a H< ) and A y (a x , .... a n< ) will be said to be distinct (as prime 
name forms), if A^tj, .... t n< ) and A^Uj, .... u n< ) are not the same 
formula for any terms t r , ..., t„ ( , u x , ..., u fl< . For example, a-\-a=b and 
0 =a-b are distinct, but a-\-a=b and a=b-c are not distinct. 

Theorem 16. Converse of substitution for predicate letters. 
Under the same stipulations as in Theorem 15, without Provisos (A) and (B), 
but provided instead that (2) be distinct prime name forms'. If D*, ..., D* 
I- E*, then D x , ..., D, f- E. 

This can be proved using the same ideas as Theorem 4 (§ 25), and like¬ 
wise admits a second version. 

Name form replacement. Using the notion of name form oc¬ 
currence indicated above (following Example 5), the replacement theory 
(Theorem 14 with *66, cf. §33 following Corollary 2) can be formulated 

thus: A(x 1( ..., x n ) ~ B (x v ..., x„) b Xl "' x " C A (tj.t n > ~ CB(tj.t„>, 

provided the parts A(t p ..., t„) and B^, ..., t„) constitute occurrences 
of A(x 1( ...,x n ) and B(x 1 , ..., x„), respectively, as name forms in 
Xj, ..., x n . 

Remark 2 . Results similar to Theorems 15 and 16 and Remark 1 
hold for individual variables. We shall state only the following, (a) If 
z v .. z Q are distinct variables not occurring bound in D(z 1; ..., z Q ) and 
E(z v ..., z 9 ), and D(z 1( ..., z Q ) h E(z 1 , ..., z q ) with z v ...,z Q held 
constant, then there is a deduction of E{z v ...,z,) from D(z 1 , ...,z s ) 
in which z v ..., z q do not occur bound. Proof, stated for q — 1 and with 
just one variable y varied. By Lemma 8a, V-introd. (on z), and change 
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to a new variable z, Vz[VyD(z) 3 E(z)] is provable. This formula does not 
contain z at all. For the definition of what constitutes a proof in the predicate 
calculus of a given formula, all variables not in the formula are on a par. 
Therefore there is a proof of Vz[VyD(z) 3 E(z)] not containing z. From this 
formula and D(z) we can deduce E(z) by V-elirn. (on z), V-introd. (on y) 
and 3-elim. (b) Let z v ... ,z q be distinct variables not occurring bound in 
D(z 1 , ..., z„) and E(z 1( . . ., z Q ) ; and let t x , .. ., t, be distinct prime terms 
(i.e. individual symbols or variables) none of which occurs in D(z x , ..., z q ) 
or E(z x> ...,z q ) unless it is one of z v ...,z q . Then D(z x , ...,z q ) 
b E(z x , . . ., z q ) with z v .. ., z q held constant, if and only if D(t x , ..., t a ) 
(- E(tj, ..., t 0 ) with ( the variables among) t x , ..., t„ held constant. For by 
(a), z 1( ..., z q , t x , ..., t 7 can be eliminated as bound variables from the 
given deduction, after which every inference will remain valid on sub¬ 
stituting throughout t 1( . . ., t q for z x , . .., z q , or vice versa. 


§ 35. Equivalences, duality, prenex form. Theorem 17. If x and y 

are distinct variables, A, B, A(x), B(x) and A(x, y) are formulas, A and B 
do not contain x free, and for *79 and *80 if x is free for y in A(x, y), then : 


*75. bVxA~A. *76. b3xA~A. 

*77. b VxVyA(x, y) ~ VyVxA(x, y). *78. b 3x3yA(x, y) ~ 3y3xA(x, y). 
*79. b VxVyA(x, y) 3 VxA(x,x). *80. b 3xA(x, x) 3 3x3yA(x, y). 
*81. b VxA(x) 3 3xA(x). 

*82. b 3xVyA(x, y) 3 Vy3xA(x, y). 

(Alterations of quantifiers.) 


*83°. b 3xA(x)-iVx-iA(x). *84°. b VxA(x)- i3x-iA(x). 

(Each of 3 and V in terms of the other and -i.) 


*85°. b -iVxA(x) ~ 3x-iA(x). 
*87. b VxA(x) & VxB(x) ~ 
Vx(A(x) & B(x)). 
*89. b A & VxB(x) ~ 

Vx(A & B(x)). 

*91. b A & 3xB(x) ~ 

3x(A & B(x)). 

*93. b 3x(A(x; & B(x)) 3 
3xA(x) & 3xB(x). 


*86. b ~i3xA(x) ~ Vx-iA(x). 
*88. b 3xA(x) V 3xB(x) 

3x(A(x) V B(x)). 

*90. b A V 3xB(x) ~ 

3x(A V B(x)). 

*92°. b A V VxB(x) ~ 

Vx(A V B(x)). 

*94. b VxA(x) V VxB(x) 3 
Vx(A(x) V B(x)). 


(Transfer of -i, & and V across quantifiers). 
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*83a. h 3xA(x) D-iVx-iA(x). *84a. 1- VxA(x) 3 -i3x-iA(x). 

*85a. h 3x-iA(x) 3 -iVxA(x). *92a. 1- AVVxB(x) 3 Vx(A VB(x)). 

(Additional results of interest for the intuitionistic system.) 

*95. b Vx(A 3 B(x)) ~ A 3 VxB(x). 

*96. b Vx(A(x) dB)~ 3xA(x) 3 B. 

*97°. b 3x(A 3 B(x)) ~ A 3 3xB(x). 

*98°. b 3x(A(x) 3 B) ~ VxA(x) 3 B. 

*99°. b 3x(A(x) 3 B(x)) ~ VxA(x) 3 3xB(x). 

*97a. b 3x(A 3 B(x)) 3 (A 3 3xB(x)). 

*98a. b 3x(A(x) 3 B) 3 (VxA(x) 3 B). 

*99a. b 3x(A(x) 3 B(x)) 3 (VxA(x) 3 3xB(x)). 

(Transfer of quantifiers across 3, with comparison of classical and 

intuitionistic results.) 

Proofs, for the classical system, of *75—*94, excepting *76, *78, 
*80, *88, *90, *92, *94. Work is saved by postponing these seven until we 
have duality (or for *80 and *94, the dual-converse relationship). Then, 
classically, *95—*99 will follow by using *59 (§27) with *88, *90, *92 and 
*85, *86. 

*75 If we redesignate A as “A(x)”, then since A does not contain 
x free, A(t) is also A(x) (§ 18). The result follows, using V-elim. and 
3-introd. (or Axiom Schema 10), and V-introd. and 3-introd. [x is 
not varied in the V-introd., since A(x) does not contain it free], and *16. 
*79. 1. VxVyA(x, y) b A(x, x) b x VxA(x, x) — double V-elim. 

(*65) [x, x is a pair of terms free for x, y in A(x, y)], V-introd. 

2. b VxVyA(x, y) 3 VxA(x, x) — 3-introd., 1 [x is not varied 
in 1, since VxVyA(x, y) does not contain x free]. 

*82. 1. A(x, y) b 3xA(x, y) b y Vy3xA(x, y)— 3-introd., V-introd. 

2. VyA(x, y) b Vy3xA(x, y)—V-elim., 1. 

3. 3xVyA(x, y) b Vy3xA(x, y) — 3-elim., 2 [Vy3xA(x, y) does not 
contain x free, and no variable is varied in 2, since y (cf. 1) 
does not occur free in VvA(x, y)]. 

4. b 3xVyA(x, y) 3 Vy3xA(x, y) — 3-introd., 3 [no variable 
is varied in 3 (cf. Lemma 7b § 24)]. 

Note how the attempt to give a corresponding demonstration of the con¬ 
verse of *82 (i.e. of *82 with the direction of 3 reversed) is defeated by 
the restriction on the use of subsidiary deduction in the predicate calculus: 
1. A(x, y) b y VyA(x, y) b 3xVyA(x, y) — V-introd., 3-introd. 
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2? 3xA(x, y) )- 3xVyA(x, y) — 3-elim., 1. But this is illegitimate, 
since the rule of 3-elimination (in contrast to the V-elimination rule used 
at Step 2 for *82) is a subsidiary deduction rule, and is inapplicable here 
because in the subsidiary deduction 1 the variable y is varied for the 
assumption formula A(x, y) to be discharged (except if A(x, y) does not 
contain y free). 

There is no way around this difficulty, and the converse of *82 should 
not be provable for arbitrary A(x, y). In terms of the interpretation, the 
formula 3xVyA(x, y) says that there is one x such that for every y, 
A(x, y); and Vy3xA(x, y) says merely that for every y there is some x, 
not necessarily the same x for different y’s, such that A(x, y). The dis¬ 
tinction is familiar to mathematicians from the example of uniform con¬ 
vergence vs. ordinary convergence of a sequence of functions a n (x) to a 
limit function a(x) on an interval or other range X of x. Using the present 
logical symbolism, and variables p, n and N ranging over natural numbers, 
and x over X, the properties of uniform convergence and ordinary con¬ 
vergence are expressed respectively by 

(i) V^>3.ZVV%V«(tt > N 3 | a n (x) — a(x) | < 1/2 P ), 

(ii) V^Vv3AVw(n > JV D | a n (x) ■—«(%) | < 1/2 P ). 

Then *82 says that uniform convergence implies ordinary convergence; 
but the converse is not generally true. (Similarly for uniform and ordinary 
continuity.) 

A metamathematical demonstration that Vi>3a<37(<2, b) 3 la'ib^ia, b) 
is unprovable in the predicate calculus will be given in Example 2 § 36. 

*83. 1. A(x), Vx-i A(x) |- -iA(x)— V-elim. 

2. A(x), Vx-iA(x) b A(x). 

3. A(x) b -i Vx-iA(x) — -i-introd., 1,2 [no variable of Vx-iA(x) 
is varied in 1 or 2]. 

4. 3xA(x) b -iVx-iA(x) — 3-elim., 3 [-iVx-iA(x) does not con¬ 
tain x free, and no variable is varied in 3]. 

5. b 3xA(x) 3 -i Vx-iA(x) — 3-introd., 4 [no variable is varied 
in 4]. 

6. -i3xA(x), A(x) b 3xA(x) — 3-introd. 

7. -i3xA(x), A(x) b ”i3xA(x). 

8. -i3xA(x) b ~iA(x) b x Vx-iA(x) — -i-introd., 6, 7 [no var¬ 
iables are varied in 6, 7], V-introd. 

9. b ~i3xA(x) 3 Vx-iA(x) — 3-introd., 8 [x does not occur free 
in the assumption formula -i3xA(x) of 8 which is being dis¬ 
charged]. 
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10. 1- -i Vx ~»A(x) 3 3xA(x) —contraposition (*14), 9. 

11. b 3xA(x) ~ -iVx-iA(x) — &-introd. (*16), 5, 10. 

*84. See Example 1 §34. 

*87. 1. A(x), B(x) b A(x) & B(x) b x Vx(A(x) & B(x)) — &-introd., 
V-introd. 

2. VxA(x), VxB(x) b Vx(A(x) & B(x)) — V-elim. twice, 1. 

3. VxA(x) & VxB(x) b Vx(A(x) & B(x)) — &-elim„ 2. 

4. b VxA(x) & VxB(x) 3 Vx(A(x) & B(x)) — 3-introd., 3 [no var¬ 
iable is varied in 3, since x (cf. 1) does not occur free in 
VxA(x) & VxB(x)]. 

5. A(x) &B(x) b A(x) b x VxA(x) — &-elim., V-introd. 

6. A(x) & B(x) b B(x) b x VxB(x) — &-elim., V-introd. 

7. A(x) & B(x) b x VxA(x) & VxB(x) — &-introd., 5, 6. 

8. Vx(A(x) & B(x)) b VxA(x) & VxB(x) — V-elim., 7. 

9. b Vx(A(x) & B(x)) 3 VxA(x) & VxB(x) — 3-introd., 8 [no var¬ 
iable is varied in 8, since x (cf. 7) does not occur free in 
Vx(A(x) & B(x))]. 

10. b VxA(x) & VxB(x) ~ Vx(A(x) & B(x)) — &-introd., 4, 9. 

*89, *91', *93. If we read “A” for “A(x)” and '‘VxA(x)” in the pre¬ 
ceding, omitting one V-elimination at Step 2 and the V-introduction at 
Step 5, it reads as a demonstration of *89. Then substituting 3 for V 
throughout, we get a demonstration of *91. The reader may write this out, 
and verify that the conditions for the 3-eliminations are satisfied. But 
substitution of 3 for V in the proof of *87 does not work. Why? We thus 
obtain only *93 but not the converse. 

As a further exercise, the reader may attempt to give a corresponding 
proof of *92, and see how this is defeated by the restriction on subsidiary 
deductions. The result *92, which we shall infer from *91 after we have 
duality, is interesting as an example of a formula which does not contain 
-*i but which we do not succeed in proving without Postulate 8. 

In view of *49b §27, there can be at most 18 (= 3-2-3) intuitionistically 
non-equivalent formulas formed from A(x) by quantifying x and possibly 
applying negation. (We use either 0,1 or 2 -Vs first, then either Vx or 3x, 
then again either 0, 1 or 2 -i’s.) 

Corollary. Each of the jour tables I — IV comprises formulas equiv¬ 
alent to one another in the classical predicate calculus. For each table, in 
the intuitionistic system: Each two formulas not separated by a line are equiv¬ 
alent. Each formula implies any formula below it, i.e. the implication from 
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the one to the other is provable. The double negation of the implication 
from each formula to any formula not separated from it by a double line is 
provable (and hence, using *49a and *25, of the equivalence). (Heyting 1946.) 



I 


II 

a. 

VxA(x) 

a. 

3xA(x) 

b. 

-i-iVxA(x) 

b. 

3x n-iA(x) 

Ci. 

Vx —i—1 A(x) 

Ci. 

-i-i3xA(x) 

c 2 . 

-1-1 Vx -i-iA(x) 

c 2 . 

-i-i 3x -1-1 A(x) 

c 3 . 

-1 3x -1 A(x) 

c 3 . 

-1 Vx -1 A(x) 


III 


IV 

a. 

3x -1 A(x) 


Vx -1 A(x) 

bi- 

“i-i 3x -1 A(x) 

a 2 . 

-1-1 Vx -1 A(x) 

b 2 . 

-1 Vx -1-1 A(x) 

a 3 . 

-1 3x -1—1 A(x) 

c. 

-1 VxA(x) 

a 4 . 

-i3xA(x) 


Proofs for Table II, in the intuitionistic system. b Ila Z)IIb [*49a, *70]. 

bllbDlICg [*85a]. b Hc 3 ~ Ilq [*86]. b IIc 3 -iVxn-nA(x) 

[*49b] ~ IIc 2 [*86]. b —i - «(IIc 2 D lib) [*51b]. Likewise, b _ i _ i(IIc 1 3 
Ila). Hence b —«—1 (lib D Ila), by *24 with b Hb D IIc x and *49a. 

Theorem 18°. Let D be a predicate letter formula constructed from the 
distinct predicate letters P 1 (a 1 , .... a n ),..., P m (a 1 , a*,,) and their 
negations ~iP 1 (a 1 , ..., a ni ), ..., -iP m (a 1; ..., a n J using only the opera¬ 
tors Sc, V, Vx and 3x (for any variable x). Then a formula D+ equivalent to 
the negation -1D of D is obtained by the interchange throughout D of Sc with 
V, of V with 3, and of each letter with its negation. 

In other words, if D be such a predicate letter formula, and Dt be the 
result of the described interchange performed on D: b ~iD ~ D+. 

Example 1°. b “■ 3a (Vi> -\Td(b) Sc (-iS V 3c C(a,c,b)))~ 

Va{lb ZI(b) V ( £&Vc-iC(d,c, b))). 

Proof is by the same method as Theorem 8 (§ 27), using *85 and *86 
to handle the two new cases which now arise under the induction step. 
The theorem as before admits a second version. 

Corollary 0 . An equivalence between two letter formulas E and F of the 
type described in the theorem is preserved under the interchange throughout 
E and F of Sc with V and of V with 3. 
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In other words, if E and F be two such predicate letter formulas, and E' 
and F' be the results of the described interchange performed on E and F, 
respectively. If 1- E ~ F, then (- E' ~ F'. (Principle of duality.) Also, if 
|- E D F, then (- F' D E'. (Dual-converse relationship.) 

The corollary follows from the theorem as before. 

Example 2°. By *91, f- <77 & 3aB{a) ~ 3<z(<37 & B{a)). Thence by 
duality, \- <77 V VaB(a) ~ V# (<77 V B(a)). Thence *92, as in Example 7 § 34. 

Similarly, we obtain *76, *78, *88, *90 as duals of *75, *77, *87, *89 
and *80, *94 as dual-converses of *79, *93, respectively. (Note that 
each of *81 and *82 is self-dual-converse.) 

Theorem 19°. Given any formula C, there can be found a formula D 
(called a prenex form of C) with the following two properties. The formula 
C is equivalent to D, i.e. |- C ~ D. In D, all the quantifiers {if any ) stand 
at the front, i.e. all the other logical symbols D, &, V, —t {if any) stand 
within the scope of every quantifier (such a formula we say is prenex). 

Proof. To reduce a formula C to prenex form, we can, step by step, 
move all the quantifiers outside the scopes of the logical symbols Z>, &, 
V,-i by applications of *85, *86, *89—*92, *95—*98, noting the following. 
In case the formula for application of *89—*92, *95—*98 fails to meet the 
condition that the A or the B not contain the x free, a change of bound 
variables can be made by *73 or *74. In case the A for *89—*92 stands on 
the wrong side of the & or V, *33 or *34 can be used. (This procedure for 
reduction to prenex form does not require *87, *88 or *99; but at any 
point where one of them can be applied, the use of it will save steps and 
lead to a shorter prenex form.) 

To prove that the procedure terminates, we can use as induction number 
the number of instances of a quantifier standing inside the scope of an 
D, &, V or -i, i.e. the total number of pairs, one member of which is an 
D, &, V or -i, and the other member is a quantifier standing inside 
the scope of that member. If this number is not 0, there must be some 
instance where there is no logical symbol of intermediate scope. A step 
is then carried out which removes this instance, leaving the others un¬ 
changed, so that the induction number is reduced by one. (If *87, *88 or 
*99 is used, the induction number is reduced by two or more.) 

Example 3°. The numbered formulas which follow are successively 
equivalent, each to the next, by replacements based on the equivalences 
cited at the right. The last formula is a prenex form of the first, or indeed 
of any formula in the list. 
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1. [-1 3 a d{d) V V<z£(tf)] & [C => VaT>(a)]. 

2. [Vtf -i<3(*) V VaS(tf)] & Va[C D ©(a)] — *86, *95. 

3. Va[Va nd(a) V S(a)] & Va[C D ©(a)] — *92. 

4. Va[Vi> —i(v?(/>) V S(a)] & Va[C D ©(a)] — *73. 

5. Va V/>[-i e2?(/>) V S(a)] & Va[C D ©(a)] — *34, *92. 

6. Va{V/.[-.^) VS(a)] & [C D ©(a)]} — *87. 

7. VaV/>{[-i<v7(/>) V ®(a)] & [C D ©(a)]} — *33, *89. 

§ 36. Valuation, consistency. The predicate calculus is intended 
to be a formalization of principles of predicate logic which hold good 
independently of the number of elements in the object domain, provided 
there is at least one element (§31). Hence the provable formulas should 
all be true, if we specify the number to be k, where k is any integer ;> 1. 
We may combine this idea with the one used in § 28, where we abstracted 
from true and false propositions to obtain two arithmetic objects t and f. 
This suggests a finitary valuation procedure, which will enable us to 
establish metamathematically the consistency of the predicate calculus. 

In the valuation procedure, a predicate letter formula is considered 
as representing a function of the free individual variables and of the 
predicate letters (including proposition letters) which it contains, or pos¬ 
sibly of these and other free variables and predicate letters as well. Then 
when we define the notion which will correspond to that of identical 
truth for the propositional calculus, we shall require that the formulas 
provable in the calculus should be true whatever predicate each predicate 
letter represents, and also, in view of the generality interpretation of the 
free individual variables (§ 32), whatever object from the object domain 
each free individual variable represents. 

After choosing a fixed positive integer k for the number of distinct 
objects in the object domain, it does not matter for the valuation proce¬ 
dure what the objects themselves are. It is convenient to take them to be 
(or to call them) the numbers 1, ..., k. These numbers are to be the values 
which the individual variables take. 

As before (§ 28), the proposition letters (i.e. letters for predicates of 0 
variables) take the values t, f. 

Now consider the predicate letters. For any given integer n > 0, a 
predicate letter with n attached variables differs under the logical inter¬ 
pretation df the system from a proposition letter in that it represents not 
a proposition but rather a propositional function of n variables, i.e. a 
function which takes a proposition as value for each set of values of the 
attached variables (§31). The values which we give a predicate letter with n 
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attached variables in our valuation procedure, when the propositions 
are replaced by t and f, will accordingly be not t, f, but rather functions 
of n variables each over the domain {1, ...,&} taking values in the domain 
{t, f}. There are exactly 2 kn different such functions. We call them the 
logical functions of n variables over the domain of k objects. The truth 
values t, f, which we use for the case n = 0, can be considered as being 
the 2 (= 2 fc0 ) logical functions of 0 variables. 

As before, we interpret D, &, V, -i as fixed functions over the domain 
of two objects {t, f} taking values in the same domain, defined by the 
tables given previously (§ 28). We now interpret V and 3 as fixed functions 
over the logical functions of one variable taking values from the domain 
{t, f}, where the variable x in Vx or 3x indicates of what individual 
variable the operand shall be considered as logical function. These two 
fixed logical functions are defined as follows. For a given logical function 
A(x), the value of VxA(x) is t, if A(x) has the value t for every value of x 
in the domain {1 ; otherwise the value is f. The value of 3xA(x) 

is t, if A(x) has the value t for some value of x in the domain {1, 
and otherwise the value is f. 

Given a predicate letter formula, we are now in a position to compute 
a table expressing the values of the function, of the distinct free in¬ 
dividual variables and predicate letters occurring in it (or of these and 
other variables and letters), which the formula represents. For the tab¬ 
ulation, it will be convenient first to list in some fixed order the logical 
functions which will be required as values of the predicate letters, and to 
introduce symbols to stand for them. 

Example 1 . For k — 2, let us construct the table for the predicate 
letter formula Vd(S 3 <3?(d)) V (-i d(b) & 2), for exactly the free vari¬ 
ables and predicate letters contained in it. This formula then represents 
a function of three variables, namely b, S and <3f( a ), where a is the attached 
variable of the name form used for <3 as a predicate letter with one 
variable (cf. § 31). Before constructing the table for the formula, we shall 
introduce notations for the logical functions of one variable, which we 
shall be employing as values of the variable (a). Since k — 2, there 
are 4 (= 2 21 ) of them, I x (a), I 2 (<j), I z {a), \ 4 (d), defined by the following 
table of their values. 
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Logical functions of one variable in a domain of two objects 


Value of the inde- Corresponding value of the respective 

pendent variable function 


a 

ti (a) 

!.(*) 

h(a) 


i 

t 

t 

f 

f 

2 

t 

f 

t 

f 


The table for the given formula is now as shown (the computation of a 
sample entry will follow). 


Value of the respective Corresponding value of 

independent variable the function 


b 

B 

Jl{a) 

Va(BD4))V(i^)&21) 

i 

t 

m 

t 

i 

t 

1.(4) 

f 

i 

t 

1.(4) 

t 

i 

t 

W«) 

t 

i 

f 

*l(4) 

t 

i 

f 

1.(4) 

t 

i 

f 

I S (4) 

t 

i 

f 

*4(4) 

t 

2 

t 

*l(4) 

t 

2 

t 

1.(4) 

t 

2 

t 

1.(4) 

f 

2 

t 

1*(4) 

t 

2 

f 

ll(4) 

t 

2 

f 

1.(4) 

t 

2 

f 

1.(4) 

t 

2 

f 

1*(4) 

t 


We now give the computation for the second value (line 2 of the table). 
For this purpose, when we come to the V-operation, we shall need the 
table for B D d(a) with B having the value t and <3(a) the value 1 2 {a). 
We give this table first (its computation to follow). 
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Value of the inde¬ 

Corresponding value 

pendent variable 

of the function 

a 

t=>t 2 (a) 

1 

t 

2 

f 


The computations of the two values in this subsidiary table, using the 
table for 1 2 (a) given at the outset and the table for D from § 28, are these: 

t 3 1 M) t D U{a) 

t 3 1,(1) tDl 2 (2) 

t D t t D f 

t f 

We now return to the original question of computing the entry for the 
second line of the table for V<z(3 D <^(a)) V (-i £l(b) & 3), i.e. the value 
of this formula when b has the value 1, 3 the value t and Zl(a) the value 
l 2 (a). We start out thus. 

Va(2 D c3(<*)) V(-ic3(£)&2) 

V<*(t D 1 2 (a)) V (-i I 2 (l) & t) 

Since the value column of the table for t D \ 2 {a) does not have all t’s, 
we can replace Va (t D I 2 (<j)) by f under the interpretation given for V, 
and also we can replace 1 2 (1) by its value t from the table for l 2 (a), thus. 

f V (-11 & t) 

f v (f & t) 

fV f 

f 

The reader may verify that he understands the procedure by calculating 
other entries in our table for Vu(3 D <37(<j)) V (-ic 1(b) & 2). Of course 
there are usually short cuts that can be used in ascertaining the table 
for a formula, so that it is not necessary to go through the whole calculation 
procedure separately for every entry of the table (e.g. here we can rec¬ 
ognize at once that all eight entries for which 3 is f are t). But this cal¬ 
culation illustrates the underlying definition of the function represented 
by a predicate letter formula for a given positive integer k. 

If, for a given k, the value column of the table for a predicate letter 
formula, as a function of certain free variables and predicate letters 
including all contained in it, contains only t’s, we say that the formula 
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is k-identical, or valid in a domain of k objects. If it contains some t’s, 
the formula is said to be satisfiable in a domain of k objects. 

As before (§ 28), if it is ^-identical (or satisfiable in a domain of k 
objects) for the minimal list of free variables and predicate letters, then 
it is for any other list; and conversely. Hence again the reference to the 
list can be omitted. 

If the value columns of the tables for two predicate letter formulas 
considered as functions of just those free variables and predicate letters 
contained in either (or of any others as well) are the same, the two 
formulas are said to be k-equal. 

Theorem 20. For each fixed integer k (k > 1): A necessary condition 
that a predicate letter formula E be provable [or deducible from k-identical 
formulas T) in the predicate calculus is that E be k-identical. 

Proof. The theorem follows from two lemmas, corresponding to 
those for Theorem 9, but now referring to the postulate list of the pred¬ 
icate calculus, predicate letter formulas and ^-identity. The reasoning 
already given for the postulates of Group A1 carries over in its essentials 
to the present situation, and for the four postulates of Group A2 we show 
the treatment of two. 

Axiom Schema 10. An axiom by this schema is VxA(x) D A(t), 
where the term t for the pure predicate calculus is simply a variable, 
and this variable t is free for x in A(x). 

The variable t may be the same or distinct from x, and x may or may 
not occur free in A(x). 

We must show that VxA(x) D A(t) takes the value t, for every as¬ 
signment of logical functions as values of the predicate letters, and of 
the objects 1 , ...,k as values of the free variables, contained in 
VxA(x) D A(t). 

Consider a particular such assignment. If the assigned values are given 
to the predicate letters and free variables of A(x), except (if x occurs free 
in A(x)) to the variable x, then A(x) represents a logical function of the 
variable x (whether or not x occurs free in A(x)). 

Since t is free for x in A(x), in the sequence of symbols A(t) the free 
occurrences of t occupy the positions which are occupied in A(x) by free 
occurrences, of either t or x. Therefore the value of the logical function 
represented by A(x), when x takes as value the assigned value of t, is the 
value of A(t) for the particular assignment under consideration. 

Now there are two cases. 
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Case 1: the logical function represented by A(x) has only t's as values. 
Then the value of A(t'). which is one of these values, is t. Hence, by the 
valuation table for 3 (§ 28), VxA(x) 3 A(t) has the value t. 

Case 2: the logical function represented by A(x) has some f’s in its 
value column. Then, by the definition of the valuation process for V, 
the formula VxA(x) has the value f; and again VxA(x) 3 A(t) has the 
value t. 

Rule 12. The premise is A(x) 3 C, and the conclusion is 3xA(x) 3 C, 
where C does not contain x free. 

We have as hypothesis that A(x) 3 C takes the value t for every 
assignment of logical functions as values of the predicate letters, and of 
the objects 1, ..., k as values of the free variables, contained in A(x) 3 C. 
We must show the like for 3xA(x) 3 C. 

Consider a particular assignment for 3xA(x) 3 C, for just the predicate 
letters and free variables which it contains. Since 3xA(x) 3 C does not 
contain x free, this does not include an assignment for x. 

Now if the assigned values are given to the predicate letters and free 
variables of A(x), except (if x occurs free in A(x)) to x, then A(x) repre¬ 
sents a logical function of the variable x. 

Case 1: the logical function represented by A(x) has some t’s in its 
value column. Choose a value of x corresponding to one of these t’s; and 
consider the given assignment for 3xA(x) 3 C, together with this value 
of x, as an assignment for A(x) 3 C. Then A(x) has the value t; by 
hypothesis, A(x) 3 C has the value t; and hence by the valuation table 
for 3, C has the value t. Since C does not contain x free, this value of 
C is on the basis of the given assignment for 3xA(x) 3 C, without regard to 
the value of x. Hence, by the valuation table for 3, 3xA(x) 3 C takes 
the value t for the given assignment. 

Case 2: the logical function represented by A(x) has only f’s as values. 
Then 3xA(x) has the value f; and hence 3xA(x) 3 C has the value t. 

Example 1 (concluded). Vu(S 3 <^b{a)) V (-i^(b) & S) is not k- 
identical, since its value column contains f’s, and hence it is unprovable 
in the predicate calculus. 

Example 2. For k = 2, when £l(a, b ) takes as value the logical 
function I (a, b) such that 1(1, 1) = 1(2, 2) — t, 1(1, 2) = 1(2, 1) = f, the 
formula 'ib3a^l{a, b) 3 3aVb<3(a, b) takes the value f, so it is unprovable. 
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Corollary 1 . For each integer k > 1 , a necessary condition that two 
Predicate letter formulas E and F be equivalent is that they be k-equal, i.e. 
if 1 - E ~ F, then E and F are k-equal. 

Example 3. For k — 2, with X^a), l 2 ( a )> h( a )> ^i( a ) as in Example 1 , 
we have the following tables for the top formulas of I — IV Corollary 
Theorem 17 written with a particular variable and predicate letter. 
Hence no two of the four are equivalent. 


Tl{a) 

VaFl{a) 

3a^{a) 

3a -it Ti{a) 

V# -i c (a) 

h(a) 

t 

t 

f 

f 

w 

f 

t 

t 

f 

W) 

f 

t 

t 

f 

U{a) 

f 

f 

t 

t 


Corollary 2. The predicate calculus is {simply) consistent, i.e. for no 
formula A, both 1 - A and |- -iA. 

Proof. If A is a predicate letter formula, then, for each k, A and -i A 
cannot both be ^-identical. (Here it would suffice to have proved the 
theorem for any one fixed k. In the original proof by Hilbert and Acker- 
mann 1928 , k = 1.) For the other sense of formula, the consistency follows 
by the converse of the substitution rule (Theorem 16 §34). 

*§ 37. Set-theoretic predicate logic, ^-transforms. The logical 
functions used in the valuation procedure for a given finite k are finite 
objects, in the sense that each is represented by a table having a finite 
number of entries. 

The notion of a logical function of n variables can be stated similarly 
for the case that the object domain has an infinite number of elements, 
but with the difference that the functions cannot be described by 
finite tables. We can then define the notions of validity, and of sat¬ 
isfiability, in a given non-empty object domain. Clearly only the cardinal 
number of the object domain matters for this purpose, and not what the 
elements are themselves. 

Then by reasoning as in the proof of Theorem 20 , we can show that 
every provable predicate letter formula is valid in every non-empty 
object.domain. In demonstrating this for the case of an infinite domain, 
the reasoning employed is no longer finitary. A non-finitary step appears, 
for example, in the treatment of Axiom Schema 10, where we distinguish 
two cases according as all of the values of a certain function are t’s or 
some are f’s, there being now infinitely many values under consideration. 
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This constitutes an application of the law of the excluded middle for an 
infinite set (§ 13). In fact the notion itself of validity, for the case of 
an infinite domain and a formula containing a predicate letter with n > 0 
attached variables, is not finitary. For it requires that the value of a 
function be t for all logical functions of n variables as values of that 
predicate letter; and the class of those logical functions is non-enumerable, 
and so only conceivable (as we usually think) in terms of the completed 
infinite. 

The result that every provable letter formula of the predicate calculus 
is valid in every non-empty object domain therefore does not belong to 
metamathematics (cf. § 15). It belongs rather to what may be called 
set-theoretic predicate logic (Hilbert-Bernays 1934 p. 125), which has in 
common with metamathematics that it makes the logical formalism an 
object of study, but differs from it in not being restricted in the study to 
finitary methods. While our main business is metamathematics, the extra- 
metamathematical conceptions and results of set-theoretic predicate logic 
may have heuristic value, i.e. they may suggest to us what we may hope 
to discover in the metamathematics. 

Our success in proving the consistency of the predicate calculus 
(Theorem 20 Corollary 2) stems from the fact that we were able to fit 
to the formulas of the calculus a finitary interpretation, namely the 
interpretation by validity in a fixed finite domain (i.e. ^-identity). 
But that does not correspond to the usual interpretation of the calculus, 
to which corresponds rather validity in an arbitrary non-empty 
domain. 

On the basis of the set-theoretic result that every provable formula 
is valid in every non-empty domain, we shall now see heuristically that 
the totality of the necessary conditions given by Theorem 20, i.e. the 
property of being ^-identical for every finite k, is insufficient for prova¬ 
bility. 

This will be a consequence of the fact that a formal axiom system 
(cf. § 8) about a set D of elements which requires D to be infinite can 
be expressed by a predicate letter formula. To convey the idea, we shall 
first give the axioms as three formulas in our logical symbolism, but using the 
predicate symbol < to suggest that they are axioms for an order relation: 

-ia<a, a<b & b<c 3 a<c, 3 b(a<b). 

These are order properties which are satisfied (in the old intuitive sense, 
§ 8) when the domain of elements is the set of the natural numbers, 
and < is the usual order relation for them. It is easily seen that they 
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cannot be satisfied in any finite non-empty domain of elements. (The 
details are left to the reader.) 

Now let us express the axiom system as one predicate letter formula, 
using a{a, b) in place of a<b, and also using only bound variables: 
Va-.c3(<j, a) & VaV£Vc[c2(a, b) & <Pl(b, c) D c3(fl, c)] & 'ia3b£Z(a, b). 

This formula, call it “F”, is not satisfiable (in the new set-theoretic 
sense) in a domain of k objects, for any finite k > 0, but is satisfiable in 
the enumerable domain of the natural numbers. Its negation -iF is then 
valid in a domain of k objects for every finite k, i.e. is ^-identical for 
every finite k, but is not valid in the enumerable domain. Thus -iF 
meets the totality of necessary conditions of Theorem 20, but by the 
set-theoretic result that every provable formula is valid in every non¬ 
empty domain, it cannot be provable in the predicate calculus. This ex¬ 
ample -iF and others are given in Hilbert-Bernays 1934 pp. 123—124. 

In terms of the set-theoretic interpretation, the completeness of the 
predicate calculus should mean that every predicate letter formula 
which is valid in every non-empty domain should be provable. This 
interpretation is not finitary, unlike the corresponding interpretation 
for the propositional calculus (§§28, 29), and so the corresponding com¬ 
pleteness problem does not belong to metamathematics. These remarks 
suggest that the situation is not as simple as for the propositional calculus, 
when we come to the question of completeness and to the decision problem. 
We shall return to these problems in a later chapter on the predicate 
calculus, where several results will be presented, partly metamathematical 
and partly of the set-theoretic variety (Chapter XIV). 

For later reference, we summarize our present conclusions in a theorem 
and corollary, which we mark with the letter “ c ” to show that they do 
not belong to our sequence of metamathematical theorems, but are only 
established here by use of non-finitary classical methods. Although we 
are using “set-theoretic” for the present kind of predicate logic, some of 
the results are only on the level of classical number-theory; e.g. while 
Theorem 21 in its full generality involves sets of arbitrarily high cardinal 
number, the corollary can be inferred also from a specialization of 
Theorem 21 relating to the enumerable domain of the natural numbers and 
the enumerable class of the logical functions which are expressible by the 
use o'f the logical symbolism applied to < (where a < b is t or f, according 
as a is a lesser natural number than b or not). 

Theorem 21 c . In each non-empty object domain : Every predicate letter 
formula which is provable (or deducible from valid formulas) in the predicate 
calculus is valid. 
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Corollary 0 . In order that a predicate letter formula be provable in the 
predicate calculus, it is not (in general) sufficient that the formula be k- 
identical for every positive integer k. 

Analogy between V, 3 and &, V. When interpreted in a finite domain 
of k objects the formula VxA(x) is synonymous with 

A(l) & ... & A(k), and 3xA(x) with A(l) V ... V A(ft), where 1, ..., k 
are names in the formal system for the objects. This suggests a slightly 
different approach to the results of the preceding section, from which 
we will obtain an intermediate metamathematical result of some interest 
in itself. (For the case k = 2, cf. Hilbert-Ackermann 1928 pp. 66 — 68.)' 

We shall take 1,2, .... It to be the formal expressions O', 0”, ..., 0'' 
(the last having k accents) which we call numerals from 1 to k. (However 
it would serve our purpose equally well to use k individual symbols.) 

We define predicate letter formula with k individuals, or briefly k- 
predicate letter formula, by allowing the terms for Clause 1 of the definition 
of predicate letter formula in § 31 to include now the numerals from 1 to k 
as well as the variables; and the predicate calculus with this notion 
of formula we call the predicate calculus with k individuals, or briefly the 
k-predicate calculus. 

Then we define k-proposition letter formula either as a /^-predicate letter 
formula containing no variables free or bound (and therefore no quanti¬ 
fiers) ; or equivalently by allowing the formulas for Clause 1 of the def¬ 
inition of proposition letter formula (§ 25) to include not only the propo¬ 
sition letters but now also the expressions resulting from the predicate 
letters by substituting numerals from 1 to k for each of their attached 
variables, e.g. for k = 2, <£7(1), <£7(2), 2(1), <£7(1, 1), <£7(1, 2), <£7(2, 1). 

A convenient abbreviation is to write the latter as “<£7/’, "<£7 2 ”, “2/’, 
‘‘<£7n”, “<£7 12 ”, “<£7 21 ”, respectively, so that in effect we merely augment 
the former list of proposition letters by the same alphabetical letters with 
finite numbers of positive integral subscripts <; k. Our former theory for 
the pure propositional calculus obviously will apply unchanged, if now 
each two letters differing either alphabetically or in their subscripts are 
treated as distinct proposition letters. 

Given any closed ^-predicate letter formula, we define its k-transform 
to be the /^-proposition letter formula which results from it by replacing, 
successively, each part of the form VxA(x) where A(x) is a A-predicate 
letter formula by A(l) & ... & A(k), and likewise each part 3xA(x) by 
A(l) V ... V A(ft), until all of the quantifiers are eliminated. It is easily 
seen that the order of the replacements does not affect the result. 
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Example 1. The 2-transform of b) 'D 3aVb<3(a, b) is 

(<77(1, 1) V <77(2, 1)) & (<77(1, 2) VcT(2, 2)) D (<77(1, 1)&<77(1, 2)) V (<77(2, 1) 
& <77(2, 2)), or briefly 

(<ar„ V <77 21 ) & (<77 12 V <77 22 ) D (<3 u & c 3 12 ) V (<ar 21 & <t? 22 ). 

Given a ^-predicate letter formula A(x x , with exactly the 

distinct free variables Xj, ..x„, the set of its k-transforms shall be the 
^-transforms of the k n closed ^-predicate letter formulas obtained from 
A(x x , . ..,x„) by substituting for x lt ...,x n each of the k n different 
w-tuples of numerals from 1 to k. 

Example 2. The 2-transforms of V<j<77(u, c) D <77 (b, c) are 
<77 u & <77 21 —j <77 n , <77 12 5<. <77 22 —o < 77j^ 5c <77 2 ^ c2? 2 ^, <77 ^ 2 5c <77 22 —j <77 221 

Theorem 22. For c’acA k > 1 : If a formula E 7s provable (deducible 
from formulas T) t'« the pure or k-predicate calculus, then all of the k- 
transforms of E are provable (deducible from the k-transforms of the formidas 
T) m the propositional calculus. (Hilbert-Bernays 1934, pp. 119 ff.) 

Proof is left to the reader. 

We easily see that a predicate letter formula is ^-identical, if and only 
if all of its ^-transforms are identically true (§ 28). Thus, using Theorem 
9, Theorem 20 becomes a corollary of Theorem 22. 

Example 1 (concluded). The formula 
(<77 n V <77 21 ) 5c |77j 2 V 7T 22 ) ^ (<77^ 5c <77 j 2 ) V (77 2 j 5c o77 22 ) is not provable 
in the propositional calculus, since it takes the value f when <77 u , <7? 12 , 
<77 2 i, <77 22 take the respective values t, f, f, t. Hence V£3<z<77(cz, b) D 
3 a'ib^l(a, b) is unprovable in the predicate calculus. Comparing these 
values for <77 u , <77 12 , <77 21 , 7? 22 with the table for I (a, b) in Example 2 
§ 36, we see that this is basically the same refutation as there. 

Example 3. Is -i-i3a<77(<j) D 3<3 -i-i<77(a) (which is of the form 
IICj D lib of Corollary Theorem 17) provable in the intuitionistic pred¬ 
icate calculus? By Theorem 22 (with k — 2), it is only if -i-i(<77 1 V <77 2 ) 
D —1 “i <7?! V —1 “i <77 2 is provable in the intuitionistic propositional 
calculus (a result to be used in § 80). 

It is often useful heuristically in studying the predicate calculus to 
think of VxA(x) as a conjunction extended over all the members of the 
object domain, and of 3xA(x) similarly as a disjunction, even though 
only in the case of a finite domain with a given number k of members 
are we able to construct synonymous formal expressions on this basis. 
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From this analogy between V and & and between 3 and V, and the notion 
of & as analogous to • and V to -j- (end § 29), some authors write VxA(x) 
as “n x A(x)” and 3xA(x) as “2 x A(x)”. (Compare also the definitions of the 
product and sum of a set of sets, § 5.) 

In retrospect, we can see how the postulates 3 — 6 for & and V suggest 
those for V and 3. The actual postulates of Group A2 are what the analogy 
calls for, allowing for certain differences in detail. The analogy is clear 
in the corresponding derived rules of Theorem 2 . Of the numbered 
results, e.g., *75, *76, *83, *84, *91, *92 are analogous to *37, *38, *56, 
*57, *35, *36, respectively. 

Predicate calculus with a postulated substitution rule. As in 
the case of the propositional calculus (§30), the predicate calculus is 
usually formulated with a postulated instead of a derived substitution 
rule, namely, in the notation of Theorem 15 § 34 and under Proviso (A): 

E 

E*. 

The predicate letters are then called predicate variables', the rule is usually 
construed to apply to a single variable at a time; and as before these 
variables must be held constant in subsidiary deductions for assumptions 
formulas to be discharged. (The first accurate statement of the substi¬ 
tution rule seems to be that of Hilbert-Bernays 1939 pp. 377—378, or 
1934 p. 98 understanding the restriction on “Einsetzung” to apply also 
to “Umbenennung”.) 

ti-VALUED predicate calculi. Cf. Rosser and Turquette 194 S- 51 , 
1952 . 

Several-sorted predicate calculi. As was implied in §31, the 
predicate calculus may be studied with several object domains, some of 
the variables being specified as ranging over one of these domains, 
others over another, etc. If these several object domains are regarded as 
simply several different primary categories of objects, the only new feature 
involved is that no operation should be performed which substitutes a 
term or variable of one sort, i.e. referring to one of the domains, for one 
of another. For example, *79 would hold only when x and y are variables 
of the same sort. Each of the attached variables for a name form should 
be of a specified sort. (Cf. Herbrand 1930 , Schmidt 193 S, Wang 1952 and 
Example 13 §74.) 

Higher predicate calculi. However, one may also obtain predicate 
calculi with several types of variables by starting with predicate calculus 
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first with a single primary domain of objects called individuals', then 
with an additional domain of objects consisting of the predicates over the 
first domain of objects, thus admitting quantifiers VP and 3P where 
P(a 1( ..., a n ) is a predicate variable of the first system; and so on. When 
a hierarchy of predicate calculi built on this plan is being considered, the 
first is called the restricted predicate calculus or predicate calculus of first 
order, and the others predicate calculi of second order, third order, etc., 
or generally higher predicate calculi. Many difficult questions arise in 
considering hierarchies of systems of this sort, which are investigated by 
the logicistic school (§ 12). A brief introduction is provided by Chapter 
IV of the 2 nd (1938) or 3rd (1949) edition of Hilbert-Ackermann 1928 . 
Predicate calculi of second order are treated in Church 1956 Chapter V, 
and Church planned to treat predicate calculi of higher order in Chapter 
VI of his projected vol. II. 
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§ 38. Induction, equality, replacement. In this chapter we return 
to the study of the full formal system of Chapter IV. 

We shall now state our results mainly with particular formal variables, 
as the postulates of Group B (after 13) were stated, so that the provable 
formulas will read as particular theorems of number theory formalized 
in the symbolism of the system. Results of the form A stated with 
particular free variables can be applied with terms substituted for those 
variables, in view of the substitution rule for individual variables (§ 23 
and *66 §32). 

From Postulate 13 (using D-introd., V-introd., &-introd. and D-elim.), 
we have the following formal rule of mathematical induction. Formal in¬ 
ductions by this rule are of course altogether separate from informal 
mathematical inductions used in proving metamathematical theorems. 

Induction rule. Let x be a variable, A(x) be a formula, and Y be a 
list of formulas not containing x free. If Y \- A(0), and T, A(x) \- A(x') 
with the free variables held constant for A(x), then Y \- A(x), 

Beginning with the proofs of the next theorem, we shall frequently 
adopt a more informal presentation of demonstrations of formal prova¬ 
bility or deducibility. The use above of the chain method in handling 
equivalences (§ 26) was a step in this direction. We now go further, omit¬ 
ting the symbol “ j-” in many situations. Thus we shall say “assume A”, 
meaning that we wish to take A as an assumption formula in constructing 
a deduction. Of course, properly we are not assuming anything, but 
indicating that the formulas to follow are to be formally deducible from A 
(and any other assumption formulas which we have introduced), until 
the discharge of the assumption formula A is indicated or implied by 
the context. At each stage in such informal presentation, the formulas 
given are to be understood as deducible from all the assumption formulas 
not yet discharged. 

Example 1. The following is a demonstration in this presentation 
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that P 3xA(x) D -i Vx -iA(x) (cf. the first half of the proof of *83 § 35). 

In preparation for D-introd., assume 3xA(x). Preparatory to 3-elim. 
from this, assume A(x). We shall deduce -iVx-iA(x) by reductio ad 
absurdum (i.e. -i-introd.). Assume for this purpose Vx-iA(x). Then by 
V-elim. -iA(x), contradicting A(x). [Then -i Vx-iA(x) by the -i-introd., 
which discharges the assumption Vx -iA(x). Since -i Vx -i A(x) does not 
contain x free, the 3-elim. can be completed now, discharging A(x). 
Finally we have 3xA(x) 3 -iVx-iA(x) by the D-introd., discharging 
3xA(x).] The bracketed steps will often be tacit. 

To analyze this, we list the formulas, showing by an arrow how long 
each assumption remains in force. 

1 . 3xA(x) — assumed. 

2. A(x) — assumed. 

| 3. Vx-iA(x) — assumed, 

j 4. -iA(x) — V-elim., 3. 

- 5. -iVx-iA(x) — -i-introd., 2, 4. 

. 6. -iVx-iA(x) — 3-elim., 5. 

7. 3xA(x) D-i Vx-i A(x) — D-introd., 6. 

Each formula is deducible from the assumption formulas whose arrows 
appear opposite it, e.g. Line 5 means that -iVx-iA(x) is deducible 
from 3xA(x) (Line 1) and A(x) (Line 2). Stating these facts in the 
P-notation, our demonstration appears in the former style. 

1. 3xA(x) p 3xA(x). 

2. A(x), 3xA(x) p A(x). 

3. Vx-iA(x), A(x), 3xA(x) p Vx-iA(x). 

4. Vx-iA(x), A(x), 3xA(x) p -iA(x) — V-elim., 3. 

5. A(x), 3xA(x) p -iVx-iA(x) — -i-introd., 2, 4. 

6 . 3xA(x) p -iVx-iA(x) — 3-elim., 5. 

7. p 3xA(x) D-iVx-iA(x) — D-introd., 6. 

Note that the columns of assumption formulas take the place of the 
arrows. For Step 5, we can supply Vx -i A(x) as an additional assumption 
formula in 2 by general properties of p. At Step 6, 3-elim. gives 3xA(x), 
3xA(x) p -i Vx -i A(x), and the extra 3xA(x) is omitted by general prop¬ 
erties of p. Actually it is immaterial whether 3xA(x) is considered as 
an assumption formula for 2—5 or not. 

Applications of the rule of V-elimination will now often be described 
in the language of “cases” (cf. § 23). Applications of the formal induction 
rule will often be presented in the following manner, using the same ter- 
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minology as in informal inductions (§ 7): Basis. ... A(0). Induction step. 
Assume A(x) (the hypothesis of the induction). Then ... A(x'). [At this 
point the induction step is completed, and A(x) ceases to be an assumption 
formula.] Hence A(x). [Other formulas T may have been in use as as¬ 
sumption formulas throughout. This proof that b A(x), or that T |- A(x), 
we say is "by (formal) induction on x”; A(x) is the "induction formula”.] 

This informal presentation is convenient in situations where the formal 
development closely parallels intuitive reasonings. It saves space, and 
brings our procedures for demonstrating the facts about formal provability 
and deducibility still closer in appearance to the methods of informal 
mathematics (cf. § 20). 

The reader should then understand at all times how the procedures 
can be made rigorous as applications of our derived rules stated in terms 
of the symbol The b -notation has been used to give concise and 
accurate statements of our derived rules, showing clearly their structure. 
We continue to use it when we have new rules to state, and in other 
passages where it helps to emphasize the form of the deducibility relation¬ 
ships or the fact that we are talking about the formulas of the system 
(and not in them). 

We shall hereafter commonly abbreviate a-b to “ab". 

Theorem 23. (Properties of equality.) 

*100. b a=a. *101. b a=b 3 b=a. 

*102. 1 - a=b & b—c D a=c. 

(Reflexive, symmetric and transitive properties.) 

*103 (Axiom 17). b a=b "D a'=b'. 

*104. b a—b 3 a+c=b-\-c. *105. b a=b 3 c-\-a=c+b. 

*106. b a=b 3 ac=bc. *107. b a=b 3 ca=cb. 

(Special replacement properties of the function symbols +, •.) 

*108 (Axiom 16). b a=b 3 (a=c 3 b—c ). 

*109. b a=b 3 (c=a 3 c=b). 

(Special replacement properties of the predicate symbol = .) 

Proofs. *100. We exhibited a proof of this formula directly from the 
postulates as Example 1 § 19. Taking advantage of the derived rules 
established meanwhile, we can now summarize it thus. From Axiom 16 
by substitution (*66 § 32), a-\-Q—a 3 (<x-f0=d 'Da—a). Thence by Axiom 
18 (using 3-elim. twice), a=a. 

*101. Using the informal presentation, assume a=b. By Axiom 16, 
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a=c 3 b=c. Substituting a for c, a—a 3 b=a. Thence by *100, b—a. 

*104. Proof will be by the formal induction rule. In both the basis 
and the induction step, the method of a chain of equalities will be used 
(cf. end of § 26). Our use of this method depends on having already 
established *100—*102. Since we do not yet have the the general replace¬ 
ment property (Theorem 24 (a)), any step which entails replacing a term 
by another term given as equal to it will have to be justified by a special 
replacement result fitting the situation. Here we shall use Axiom 17 
(*103) for the purpose. Substitution in an axiom or previously established 
formula will not be explicitly mentioned. Assume a=b. We deduce 
a+c=b+c by induction on c, thus. Basis. <7+0 = a [Axiom 18] = b 
[assumption] = b+0 [Axiom 18 ]. Induction step. Assume, as hy¬ 
pothesis of the induction, a+c—b+c. Then a+c' = ( a+c )' [Axiom 19] 
= ( b+c )' [hypothesis of the induction, Axiom 17] = b+c' [Axiom 19]. 

*105. Let a=b 3 c+a=c+b be abbreviated “A(a, b)”. We shall prove 
V/>A(<z, b) by induction on a. We do the induction step, leaving the 
basis to be done similarly by the reader. Ind. step. Assume VbA(a, b). 
By V-elim., A(<z, b). We now deduce A(<*', b) by induction on b. Basis. 
Assume <7'=0. But by Axiom 15, -i <7'=0. Hence by weak -n-elim. 
(§ 23), c+a'=c+ 0. Ind. step. (We do not need to use the hypothesis 
of the induction on b.) Assume a'—b'. Then by Ax. 14, a—b\ and using 
A(a,b), c+a—c+b. Now c+< 7 '= (c+<z)' [Ax. 19] = (c+b)' [using 
c+a—c+b with Ax. 17] = c-\-b' [Ax. 19]. (Why did we take 'ibA(a, b) 
instead of A(<z, b) as the induction formula for the induction on <z? Cf. 
the statement of the induction rule. For the induction on b within the 
induction step of the induction on a, 'ibA(a, b) is the I\) 

Another way to handle *105 is to prove *118 and *119 first, after 
which *105 can be inferred from *104. 

*106 and *107. Similarly to *104 and *105. To justify replacing ac 
by be in <7c+<7 after assuming ac—bc, we must use *104; etc. 

Replacement. Theorem 24. (a) Ifu T is a term containing a specified 

occurrence of a term r, and u s is the result of replacing this occurrence by a 
term , s, then 

r=s 1- u r —u s . 

(b) If C r is a formula containing a specified occurrence of a term r (not 
as the variable of a quantifier), and C s is the result of replacing this occur¬ 
rence by a term s, then 

r=s h* 1- "*” C r ~ C s 
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where x lf ..., x n are the variables of r or s which belong to a quantifier of 
C r having the specified occurrence of r within its scope. (Replacement 
theorem.) 

Example to follow. Proof is by the same method as before (§§ 26, 33), 
using seven additional lemmas. 

Additional lemmas for replacement. If r and s are terms: 

*110. r=s |- r'=s\ 

♦111. r=s |- r+t=s+t. *112. r=s \- t+r=t+s. 

*113. r=s h rt=st. *114. r=s \- tr=ts. 

*115. r=s |- r=t'—' s=t. *116. r=s |- t=r ~ t=s. 

Proofs. The first five of these lemmas follow from *103—*107, 
respectively, by substitution (*66) and D-elim. The last two follow 
from *108 and *109 with *66, *101, D-elim. and *16. 

Example 2. Let r be b, s be a, C r be 3 d(d'+b=c). The parallel 
constructions of C r from r and of C B from s are as follows, and the depth 
is 3. 

b a 

d'+b d'+a 

d'-\-b=c d'+a—c 

3 d{d'+b=c) 3 d{d'+a=c) 

Let = be written between the expressions in each of the top two lines, 
and ~ in each of the bottom two. The resulting formulas are deducible, 
each from the preceding, using successively *112, *115, *72 (varying d). 
Using the abbreviation “<” (§ 17), the result can be written b—a 
b<c ~a<c. (Why is d not varied in the result?) 

Corollary 1. Under the conditions of the theorem: If |- r=s, then 
1- u r =u s and |- Cr ~ C g . 

Corollary 2. Under the conditions of the theorem: r=s, C r h Xl ‘“ x " C 8 
with x t , ..., x„ varied only for the first assumption formula. If \- r—s, 
then C r |- C s . (Replacement property of equality.) 

Example 2 (concluded). Using also *101: a—b, b<c \ a<c. 

As before, a replacement may be preceded by a substitution for in¬ 
dividual variables (cf. § 33 Example 4, and § 34 just above Remark 2). 

§ 39. Addition, multiplication, order. Postulate Group B (§19) 
may be described as follows. Postulates 14, 15 and 13 express formally 
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the last three of Peano’s axioms (§ 6). (The first two enter in the present 
system, which has only natural number variables, through Clauses 1 and 5 
in the definition of term, § 17.) Axioms 16 and 17 give properties of 
equality (including as 17 the univalence of the successor function ', 
which was implicit in Peano’s formulation). Axioms 18 and 19 are what 
may be called 'recursion equations’ defining the function +, and Axioms 
20 and 21 are the like for the function •. 

For our abbreviations "a^b” and "1”, “2”, “3”, .see § 17. 


Theorem 25. (Arithmetic laws.) 

*117. b ( a+b)+c=a+(b+c ). *121. b (ab)c=a(bc). 

*118. b a'+b=(a+b)'. *122. b a'b=ab+b. 

*119. b a+b=b+a. *123. b ab=ba. 

*120. b a(b-{-c)=ab-\-ac. 

(Associative, commutative and distributive laws for + and *, with lemmas 
used in proving the commutative laws.) 

*124 (Axiom 18). b a-{-0=a. *125 (Axiom 20). b <r0=0. 

*126. b a+l=a'. *127. b a-l=a. 

(Direct laws for 0 and 1.) 

*128. \- a+b=0 z> a=0 & b=0. *129. b ab—0 D a=0 V b—0. 

*130. b a+b=\ => a=l V b=l. *131. b ab=\ Z> a=\ &b=\. 

(Inverse laws for 0 and 1.) 

*132. b a+c=b+c 3 a=b. *133. b c#0 3 (ac=bc 3 a=b). 

(Inverse laws for + and •.) 


Proofs. In these proofs, since we now have Theorem 24 (a), we do 
not need to invoke the special cases of it *103—*107, as we did in the 
proofs for Theorem 23. 

*117 and *118. By induction on c and b, respectively. 

* 119. By induction on a, using induction on b in the basis. 

*122. By induction on b. Ind. step. Assume a'b=ab+b. Then 
a'b' — a'b+a' [Ax. 21] = ( ab+b)+a ' [hyp. ind.] = (( ab+b)+a )' [Ax. 
19] = (ab+(b+a))' [*117] = ( ab+{a+b ))' [*119] = (( ab+a)+b)' [*117] 
= (db'+by [Ax. 21] = ab'+b' [Ax. 19]. 

*128. If b A(0) and b A (x'),then b A(x). For from b A(x') by general 
properties of b, A(x) b A(x'), and so induction on x applies. This rule 
we call induction cases (on x). For *128, call the formula to be proved 
“A(a, b)”. To prove A (a, b) by induction cases on a, it will suffice to prove 
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A(0, b) and A (+, b). To prove these two, by induction cases on b, it will 
suffice to prove the four formulas A(0, 0), A(0, b'), A (+, 0), A (a', b'), i.e. 
the formulas 

0+0=0 D 0=0 & 0=0, 0+b'=0 D 0=0&Z/=0, 

++0=0 Z> +=0 & 0=0, ++/>'=0 D +=0 & b'= 0. 

The proofs of these four are easy, as in each case we can either refute 
the premise of the implication (using Axiom 15) or prove the conclusion 
(cf. *10a and *11 § 26). 

* 130. This is established similarly, but with an iteration of the argument 
by cases, so that to prove A(x'), we prove A(O') and A(x"). Altogether, to 
prove A (a, b), it thus suffices to prove the nine formulas 

A(0, 0), A(0, 1), A(0,*"), A(l, 0), A(l, 1), 

A(l, b"), A(a", 0), A(a", 1), A (a",b"). 

The treatment of each of the nine is routine (using Axioms 14 and 15). 
*132. Induction on c, using Axiom 14. 

*133. Let “A(a,b)” abbreviate ac=bc Z3 a=b. Assume c+0. We 
deduce VaA(a, b) by induction on b, as follows. (Cf. *95.) Basis. Assume 
ac=0c. By *125 (and *123), ac= 0. But c+= 0. Hence by *129 and 
propositional calculus, a= 0. Ind. step. Assume VaA(a, b). By V-elim., 
A (a,b). We shall deduce A (a, b') by induction on a. Basis. Assume 
0 c=b'c. By *125 (and *123, *101), b'c= 0; and by *129, +=0Vc=0. 
But b' +0 by Ax. 15, and c^0 by hypothesis; hence i(i'=0Vc=0). 
From this contradiction, by weak -i-elim. (§23), 0 =b'. Ind. step. 
Assume a'c=b'c. By *122, ac+c=bc+c. By 132*, ac=bc. Thence by 
A(a,b), a=b; and by *103, a'=b’. 

Another way to handle *133 is to wait until *139 is established, 
after which it can be proved similarly to *146b. 

We now use the abbreviation “a<b” for 3c(c'+a=b) under the 
conventions discussed in §§ 17, 33; and we read “a>b” as abbreviation 
for b<a; “a^b” for a<bV a—b; "a^b” for b<,a\ ”a<b<c" for 
a<b&b<c (cf. end of §26); etc. 

Theorem 26. (Order properties.) 

*134a. )- a<b<c D a<c. *134b. h a<,b<c 3 a<c. 

*134c. 1 - a<b<,c'D a<c. *134d. I- a<,b<,c D a<.c. 

(Transitive laws.) 

* 135a. Ya<a'. *135b. H 0<+. *136. \- 0^a. 

*137 (= *137 0 ). \-a=0V3b{a=b'). *137h a=0Va= 1 V Zb{a=b"). 
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b a =0 Wa= IV a =2 V 3 b{a=b'"). 
b a<b ~ a<b'. *138b. |- <j>£ >—' a>b'. 

(Order properties for 0 and '.) 

[- a<bV a=b V a>b. 

b -\a<a. *141. |- a<b 3 -ia>b. 

(Connexity, irreflexiveness, asymmetry.) 

b *+b * 143a. |- b^ 0 3 ab>.u. 

b b=£0 3 a+b>a. *143b. b a^O & b> 1 3 ab>a. 

b b^O 3 a'b>a; hence b b^O 3 3 c(cb>a). 
b a<b ~ a-\-c<b+c. *145a. b c=£ 0 3 (<z<£ ~ ac<bc). 

b a<b ~ a+c<b-{-c. *145b. b c^O 3 (a<b ~ ac<bc). 

(Inequalities under addition and multiplication.) 

b b^O 3 3(j3r(a=bq-\-r & r<b). 

b a=bcj 1 +r 1 &r!<b & a=bcj 2 +r 2 & r 2 <b 3 ^i=< 7 2 & 0 =r 2- 
(Existence and uniqueness of quotient and remainder.) 

Proofs. *134a. Assume a<b<c, i.e. a<b & b<c, i.e. 

3 d(d'+a—b) & 3 e{e’+b—c). Preparatory to 3 - and &-elim., assume 
d'+a=b and e'+b=c. Then e'+{d'+a)=c, which can be reassociated 
as (e'+dy+a=c. By 3-introd., 3f(f'+a—c), i.e. a<c. 

*134b. From *134a, with the help of proof by cases (§ 23). (Cf. 
Example 2 § 38.) 

*136. By induction cases on a, using *135b. 

*137, *137*. By repeated use of induction cases; or thus: The 
formula of * 136 is equivalent to that of *137, using Ax. 18 and properties 
of = (cf. Example 4 § 33); and *137!, *137 2 , ... follow successively. 

*138a. Using 3- and V-elim. and introd. with *137, 
b a<b’ ~ 0'+a=b' W3c(c"+a=b'). Using *119 and Axs. 19, 17, 14 
and 18, b 0'+a=b' ~ a=b. Using *119 and Axs. 19, 17 and 14 with 
*72 (or 3-elim. and introd.), b 3 c(c”+a=b') ~ a <b. 

*139. By induction on b, using *136 in the basis, and *138a,b in the 
induction step. 

*140. Assume a<a, i.e. 3b(b'+a=a). For 3-elim., assume b'+a=a. 
Then by *132 with *124 and *119, b' = 0, contradicting Ax. 15. 
Remark. Because the contradictory formulas b'—O and b'^0 
contain b free, we cannot carry out the 3-elim. immediately. But by 


*137 2 . 

*138a. 

*139. 

*140. 

* 142a. 

* 142b. 
*143c. 

* 144a. 

* 144b. 

*146a. 

* 146b. 
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weak -i-elim. (§ 23), we can first deduce a pair of contradictory formulas 
not containing b free, e.g. 0=0 and 0#0. 

By *140, b a=b 3 -i a<b. 

*143b. Assume a^0&b> 1. By a^O with *137, 3c(<z=c');andby b>\ 
with * 140, * 141 and * 135a, b ^0 & b ^ 1, and thence with * 137 x , 3 d{b=d"). 
Assume (preparatory to 3-eliminations) a—c' and b— d". 

* 144a. From * 104, * 132, * 117 and *72. 

*145a. Assume c^0. Part 1: to deduce a<b 3 ac<bc. (Left to the 
reader.) Part 2: to deduce ac<bc 3 a <b. Assume ac<bc. To deduce 
a<b, it will suffice by cases (V-elim.) from *139 to deduce a <b under each 
of three case hypotheses. Case 1: a<b. Case 2: a=b. Then ac=bc. 
This with *140 gives -i ac<bc , contradicting our assumption ac<bc. 
By weak -i-elim., a<b. Case 3: a>b, i.e. b<a. Then by the result of 
Part 1, bc<ac, i.e. ac>bc. Thence by *141, -i ac<bc. 

*146a. Use induction on a (after assuming b^O). (Proof can also 
be based on * 143c and * 149.) 

*146b. Assume a=bq 1 +r 1 & r^<b & a=bq 2 +r 2 & r 2 <b. To deduce 
q x =q 2 , it will suffice by *139, using cases and weak -i-elim., to deduce 
a contradiction from q x <q 2 and again from qi>q 2 . Assume q x <q 2 . For 
3-elim. from this, assume e'-\rqi=q 2 - Now bq^rx = a — bq 2 +r 2 = 
b[e'+qi)+r 2 = bq^ibe’+rj. So r 2 = be'+r 2 [*132] ^ be' [*142a] 
;> b [*143a]. This contradicts <b, by *140 and *141. The other 
case is similar. Having deduced qi=q 2 > we have a = bq 1 -\-r 1 = bq 1 +r 2 , 
whence r x =r 2 by *132. 

*§ 40. The further development of number theory. Our formal 
system of number theory differs from the informal theory in that the 
logic is made explicit. We have brought our acquaintance with the logic 
to a stage where the further development of number theory in the formal 
system will proceed much along lines already familiar to us from the 
informal theory. We shall not continue systematically with this develop¬ 
ment, but will only note several aspects of it, before turning to general 
metamathematical questions about the system, 

For this section, let x be a variable, A(x) be a formula, and y and z be 
variables distinct from x and each other which are free for x in A(x) and do 
not occur free in A(x). 

The least number principle (or well-orderedness of the natural numbers) 
says that, if there exists a natural number x such that A(x), then there 
exists a least such x, call it y. The property of y can be expressed in the 
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formal symbolism by A(y)&Vz(z<y 3 -iA(z)), or in an equivalent 
form using: 

*147. b z<y 3 -iA(z) /-w A(z) 3 y <z. 

(By *13, *139.) We establish first: 

*148°. b 3y[y<x & A(y) & Vz(z<y 3 -iA(z))] V Vy[y<x 3 -iA(y)]. 
(Law of the excluded middle, and least number principle, 
for an initial segment of the natural numbers.) 

*148a. A(x)V-iA(x) b x 3y[y <x & A(y) & Vz(z<y 3 -> A(z))] V 
Vy[y<x 3 ~iA(y)]. 

*148b. b ~i“i{3y[y<x & A(y) & Vz(z<y 3-iA(z))] V 
Vy[y<x 3 ~iA(y)]}. 

(Intuitionistic versions of the same.) 

Proofs. *148. By induction on x, thus. Let the formula of *148 be 
abbreviated “P(x)VQ(x)”. Basis. From *136, *140, *141 and *10a, 
b Q(0), whence b P(0) V Q(0). Ind. step. Assume P(x) V Q(x). We deduce 
P(x') V Q(x') thence by cases (V-elim.). For Case 1, P(x) b P(x') b P(x') 
V Q(x'), using *135a, *134a. For Case 2, we use subcases from A(x) V 
-iA(x) (*51). For Subcase 2a, Q(x), A(x) b P(x') b P(x') V Q(x'), using 
*135a. For Subcase 2b, Q(x), ~iA(x) b Q(x') b P(x') VQ(x'), using * 138a. 

*148a. Since *51 is not available in the intuitionistic system, we now 
use Vx(A(x) V-iA(x)) as an assumption formula T for the induction. 

*148b. From Axiom Schema 6, by 3-elim. and contraposition twice 
(*13, *12 ), A3—i-i C, B 3 —i —iC b AVB D -nC b t'(AVB) 
3 -l -i C. Thence by the 3-rules : If T, A b “i i C and T, B b “»”» C 
(with the free variables held constant for A and B, respectively), then T, 
-i -i (A V B) b ”i C. Thus we justify a modification of proof by cases, 
in which the case formula AVB and the conclusion C are doubly negated. 
Hence, if we replace the induction formula in the proof of *148 by 
-i-i(P(x) V Q(x)), and the other case formula by -i-i(A(x) V-iA(x)) 
(which is provable intuitionistically by *5la), the induction again works 
(using *49a), and gives us *148b. 

Now we can infer the least number principle. 

*149°. b 3xA(x) 3 3y[A(y) & Vz(z<y 3 -iA(z))]. 

(Least number principle.) 

*149a. A(x)V-iA(x) b x 3xA(x) 3 3y[A(y) & Vz(z<y 3 -iA(z))]. 

*149b. b ~i-i{3xA(x) 3 3y[A(y) & Vz(z< y 3 iA(z))]}. 

(Intuitionistic versions of the least number principle.) 
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Proofs. *149 (or *149a). Assume 3xA(x); and for 3-elim., A(x). 
Substitute x' for x in *148 (or in the conclusion of *148a) to obtain 
P(x') V Q(x'). Case 1: P(x'). Thence 3y[A(y) & Vz(z<y 3 -iA(z))]. 
Case 2: Q(x'). Thence with *135a, -iA(x), contradicting A(x). By weak 
-i-elim., 3y[A(y) &Vz(z<y 3 -iA(z))]. 

*149b. Using instead *148b and the modification of proof by cases, 
3xA(x) 3 -i-i3y[A(y) & Vz(z<y 3 -iA(z))]. Now use *60h, g. 

Other consequences of * 148a are: 

*150. A(x)V-iA(x) (- x 3y[y<x & A(y)] V-i3y[y<x & A(y)]. 

*151. A(x)V-iA(x) b x Vy[y<x 3 A(y)] V-iVy[y<x 3 A(y)]. 

(Of interest for the intuitionistic system.) 

Proofs. *150. From *148a, A(x)V-iA(x) b x 3y[y<x&A(y)] V 
Vy[y<x 3 -iA(y)]. But 1- Vy[y<x 3 -iA(y)] ~ Vy-i[y<x & A(y)| 
[*58b] ~ -i3y[y<x & A(y)] [*86]. 

*151. Similarly, applying *148a to -i A(x), and replacing -i-iA(x) 
by A(x) (since by *49c, A(x) V -iA(x) |- -i-iA(x) ~ A(x)). 

As an example of a number-theoretic theorem requiring some further 
concepts, we shall treat Euclid’s theorem that there exist infinitely many 
prime numbers. This may be expressed by saying that to any number a, 
there is a prime greater than a. In fact, there must be a prime between 
a +1 and a! -f 1, inclusive, by the following reasoning. Every positive 
integer n <, a divides a\. Hence none of them except 1 divides «!+l. 
But a!+1 > 1; so it is, or has as factor, a prime. This prime is between 
a -)-1 and a ! +1, inclusive. 

The two cases may be combined by noting that the least divisor of 
a ! +1 greater than 1 is a prime greater than a. Also the reasoning holds 
good using in place of a ! any common multiple of 1, .... a. 

Preparatory to the formal treatment of Euclid’s theorem, we now 
introduce “a\b” (read “a divides b” or “a is a factor (divisor) of b”) 
as abbreviation for 3 c(ac—b). We can show: 

*152. b a\ab. *153. \-a\a. *154. b a\b & b\c 3 a\c. 

*155. I- a>l 3 -\{a\b & a\b'). *156. b bj--0 3 (a\b 3 0<a<b). 

(Properties of |.) 

(Hints: For *155, use *137 1( *145a (with *135a), *132, Ax. 15. For *156, 
use *143a.) Next we introduce “Pr(a)” (“a is prime”) as abbreviation for 
a> 1 & —i3c( 1 <c<a & c\a). Then Euclid’s theorem is expressed in the 
formal system by the formula 3/>(Pr(/>) & b>a). 
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A difficulty for formalizing the foregoing proof is that the system 
has no term to express the function a!. We avoid this by establishing: 

*157. H 3d[d>0&Vb(0<b^a D b\d)]. 

(Existence of common multiples of 1, 

(Proof by induction on a. Call the formula “3dA(a, d)”. For the basis, 
h A(0, 1). For the ind. step (preparatory to 3-elim.), A (a, d) )- 
A (a', da').) Now assume in preparation for 3-elim., 

(1) d>0 & Vb(0<b<a 3 b\d). 

The variable d governed by this formula then has the role of a !; more 
precisely, it may represent under the interpretation any common multiple 
of 1 

By (1), *144a and *153, d’> 1 &d'\d'. Hence 3e(e>\ &e\d'). By the 
least number principle (*149), 

3b[b>\&b\d'&Vc(c<b D -i (c>l&c|*/'))]. Assume, for 3-elim., 

(2) b> 1 & b\d' & Vc(c<b D-i(c>l & c\d')). 

Assume \<c<b & c\b. From c<b by (2), -i(c> 1 &c|</'); but from 
l<c, c\b, b\d' (from (2)) and *154, c>l &c\d'. By -i- and V-introd., 
Vc -i (\<c<b & c\b) ; whence by *86, -n3c(l<c<b & c\b). Using also 
b> 1 from (2), Pr(/>). 

By (2), b> 1 &b\d’. Hence by *155, -\b\d. Hence by (1), b>a. 

By &- and 3-introd., 3/>(Pr (b) & b>a). Since this formula does not 
contain b or d free, by 3-elim. the assumptions (2) and (1) are discharged. 
This completes the proof of Euclid’s theorem in the classical system. 

To prove it in the intuitionistic system, using *149a instead of *149, 
it remains to establish (e>l & e\d’) V —i (e> 1 & e\d'). For this purpose, 
we first establish: 

*158. H a—b V -ta—b. *159. h a<b V -\a<b. 

*160. h a\b ~ 3c(c<b & ac=b). 

(For use in the intuitionistic treatment of Euclid’s theorem.) 
(Prove *158 and *159 by *139—*141; *160 similarly to *156.) Then we 
prove successively e>\ V -ne>\ (by *159), e\d' V -te\d' (by *160, *138a, 
*150, *158), and (e>l & e\d') V -i (e>\ & e\d') (thence by Remark 

1 (b) § 29 and Theorem 3 § 25). Thus, intuitionistically as well as classi¬ 
cally-: 

*161. 3b'(Pr(b) & b>a). (Euclid’s theorem.) 

The process of recognizing that the proofs in an informal theory can 
be formalized in a given formal system is one of continual analysis and 
stereotyping of arguments which recur in the informal theory, to keep 
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pace as the theory develops with the increasing condensation of the in¬ 
formal reasoning. We attempt to recognize successively these types of 
informal arguments as being formalizable, and we may record the results 
we thus obtain as derived rules for the formal system. This is very similar 
to the semi-formal process of developing an informal theory itself from 
explicitly stated postulates; but here we have gone further back to 
postulate the logical as well as the (in the ordinary sense) mathematical 
principles. 

Informally, we have used mathematical induction not only in the simple 
(or ordinary) form, but also in a modification called ‘course-of-values 
induction’ (cf. § 7 including Example 2, the proof of Theorem 1 § 21, etc.). 
It is of interest now to recognize that in the formal system this mod¬ 
ification can be derived from the simple form of induction which is 
postulated for the system. We shall state it as a theorem schema; the 
method, of formulating a rule on the basis of the schema has been suf¬ 
ficiently illustrated on simple induction (cf. Axiom Schema 13 in § 19 
with the induction rule in § 38). 

In the first schema *162a, the expressions A(0) and 
Vx[Vy(y <x 3 A(y)) 3 A(x')] formalize the basis and induction step, 
respectively; in the more compact form * 162b, the two are brought 
together in the single expression Vx[Vy(y<x 3 A(y)) 3 A(x)]. 

*162a. b A(0) & Vx[Vy(y<x 3 A(y)) 3 A(x')] 3 A(x). 

*162b. (- Vx[Vy(y<x 3 A(y)) 3 A(x)] 3 A(x). 

(Course-of-values induction.) 

Proofs. *162a. Assume A(0) & Vx[Vy(y<;x 3 A(y)) 3 A(x')], deduce 
Vy(y^x 3 A(y)) by simple induction on x, and infer A(x) by V-elim. 

Sometimes inductions require a double basis; i.e. we establish as the 
basis 4(0) and 4(1), and then for the induction step infer A(x") from the 
two preceding cases A(x) and 4 (x'). This can be treated formally as a 
course-of-values induction, using cases according as x'=l or x’>l 
under the induction step; or we can make it into a simple induction by 
using A(x) & A(x') as the induction formula. This device of using a con¬ 
junction as the induction proposition applies similarly to induction from a 
k-fold basis for any fixed k>:2, and also to the inductive proof of several 
propositions simultaneously. 

Inductive arguments are sometimes presented in the guise of a de¬ 
scending induction or proof by the method of infinite descent. This consists 
in establishing that A(x) is false for every x, by showing that if 4 (x) is 
true for any x, there is a lesser number for which it is also true. 
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*163. b Vx[A(x) => 3y(y<x & A(y))] D -iA(x). 

(Method of infinite descent.) 

*163a. b Vx[A(x) D -r‘n3y(y<x & A(y))] D -iA(x). 

(Additional version of interest for the intuitionistic system.) 

Proofs. Assuming the premise of either *163 or *163a, -iA(x) 
follows by a course-of-values induction. 

Another method. Taking -iA(x) as the A(x) of *162b, the result is 
equivalent to * 163a by the following steps. 

b Vy(y<x "D -iA(y)) D ~iA(x) ~ -i[Vy-i(y<x & A(y)) & A(x)l [*58b 
twice] — A(x) D-iVy-i(y<x& A(y)) [*33, *58b] ~ 

A(x) D -i-i3y(y<x & A(y)) [*86], Then *163 follows from *163a. 

Proofs of * 163a and * 163 can also be given by reductio ad absurdum 
from *149 or from *149b; and conversely *149b can be proved by 
reductio ad absurdum from *163a (using *60h, g). (In each case, the 
formula of the one is deducible in the intuitionistic predicate calculus 
from that of the other.) 

These examples suggest that the forms of argumentation ordinarily 
encountered in informal elementary number theory will turn out to be 
formalizable in our formal system. The lack of such functions as a\ 
remains a cause for doubt (although we did get around it in proving 
Euclid's theorem). Attention will be given to this question concerning 
functions in §§41, 49, 59, 74, 82. 

In § 42 we shall take up the question of the completeness of the formal 
system. From the standpoint of the interpretation, this includes whether 
all the possible reasonings of elementary number theory (not merely 
the commonly encountered ones) are formalizable in the system, at least 
in so far as they contribute to the proof of propositions expressible in the 
system. We shall also consider a more specific, strictly metamathematical 
notion of completeness. 

§ 41. Formal calculation. A formula A is said to be {formally) 
refutable, if —iA is provable. 

A closed formula A (end § 32) is {formally) decidable, if A is either 
provable or refutable, i.e. if either b A or b ~*A. 

The formal number-theoretic system (or a system with formation 
rules of a like sort) is said to be {simply) complete, if every closed formula 
A is formally decidable; {simply) incomplete in the contrary case that 
there is a formally undecidable closed formula. 
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The restriction that A be closed is essential here, in order that the 
metamathematical notion of simple completeness should have the in¬ 
tended significance. Otherwise, under the generality interpretation of 
the free variables in both A and -i A, the second formula would not ex¬ 
press the negation of the proposition expressed by the first (§ 32). 

Example 1 . The formula 2j a (i.e. 3c(0''-c=<j))expresses: every number 
a is even; and -i 2 |a expresses: every number a is not even, i.e. every 
number a is odd. Neither proposition is true; and, we hope, neither 
formula is provable. But V<j2|a expresses: every number a is even; and 
-\'1a2\a expresses: not every number a is even. By the classical law of 
the excluded middle, one of the two propositions should be true. In fact 
the second is; and the formula —i s 4a2\a is provable. 

We did not apply the notion of simple completeness to the propo¬ 
sitional and predicate calculi, because the proposition and predicate 
letters had the role of free variables for the interpretation (§§ 28, 29, 36, 
37). Simple completeness is another example of a notion of completeness 
with a positive criterion (§ 29). 

The terms 0, O', 0", ..., which represent the particular natural 
numbers under the interpretation of the system, we call numerals, and we 
abbreviate them by the same symbols “0”, “1”, “2”, respectively, 
as we use for the natural numbers intuitively (as in §§ 17, 37). Moreover, 
whenever we have introduced an italic letter, such as “x”, to designate 
an intuitive natural number, then the corresponding bold italic letter' 
“x” shall designate the corresponding numeral 0 (x) , i.e. 0'-' with x 
accents (x ^ 0) (as in § 37). In this connection, we can also use “x—1” 
to designate the numeral with x— 1 accents (for x> 0); there is no 
ambiguity, since we have no formal But "x+1” designates 

0 (a:) +0'. 

Let P{x lt . ..,*„) be an intuitive number-theoretic predicate. We say 
that P{x v ... , x n ) is numeralwise expressible in the formal system, if there 
is a formula P(x x ,..., x n ) with no free variables other than the distinct 
variables x 1( ..., x B such that, for each particular M-tuple of natural 
numbers x v ..., x n , 

(i) if P(x v . .., x n ) is true, then |- P(Xi,. ..,x«), and 

(ii) if P(x v ...,*„) is false, then b -i P^,..., x„). 

In this case, the formula P^,..., x B ) numeralwise expresses the predicate 
P(x v ..x n ) (with the formal variables x 1( ..., x„ corresponding to the 
respective intuitive variables x v . .., x„). 

Our metamathematical use of this notion will be confined to cases 
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when there is a decision procedure for the predicate P(x v ..., x„) (§ 30), 
so that for each -w-tuple x v ..x n , 

(iii) P{x v ..x n ) is true, or P(x v .. x n ) is false. 

Using (iii) with (i) and (ii), 

(iv) h P(*i,...,*») or b -tF(x v .. .,x„); 

thus P(x 1 ,...,x„) is decidable for each x v ...,x„, or as we shall say 
P(x 1 ,..x n ) is numeralwise decidable. The formula -iP(x 1 ,..., x n ) then 
numeralwise expresses the predicate not-P^,.. x n ). 

The notion of numeralwise expressibility gives only one of the senses in 
which a formula P(x 1 ,...,x B ) may express a predicate P(x 1 ,..., x n ). 
It requires more of the deductive apparatus of the system than merely 
that P(x 1( ..., x n ) should express P(x v ..x n ) under the interpretation of 
the symbolism (with the name form interpretation for x v ..x n , § 31), 
which after all requires nothing deductively. It does not require that 
formulas expressing various general properties of the predicate should 
be formally provable. 

Example 2. The formulas 3 c(c'+a=b) and 3 c{a+c'—b) each ex¬ 
press a <b (with a, b corresponding to a, b) under the meanings of 
the formal symbols, as we could see back in § 17 before we knew anything 
about the deductive rules. 

The first formula 3 c{c'-\-a=b) (which is the one we picked for our 
permanent abbreviation “a<b”, §§ 17, 39) numeralwise expresses a<b 
in the formal number-theoretic system, and even in the system without 
the induction schema (or Axioms 20 and 21), as we now establish. 

For (i), we must show that, if a and b are any two natural numbers 
such that a < b, then 1- 3c(c'+a=6). For illustration, let a — 3, 
6 = 5. Now h 0"+0'" = (0"+0")' [Ax. 19] = (0”-b0')” [Ax. 19, 
Ax. 17] = (0”+0)'" [Ax. 19, Ax. 17 twice] = O'”" [Ax. 18, Ax. 17 
thrice]. Thus (tacitly using *102), |- 0''+0”'=0”'”. By 3-introd., 
b 3c(c'+0'”=0””'), i.e. b 3c(c'4-3=5). A similar series of steps will 
give us b 3c(c'-ba— b) for any a and b such that a < b. To prove 
this in general, we may first establish as a lemma by informal induction 
on k that for any term t, using Axs. 17—19 (and *102), b t-b0 !fe) = t (fe) . 

For (ii) we must show that, if a and b are any two natural numbers 
such'that a is not < b, then b 3c(c'+a=b). If not a < b, then 
a ^ b. For illustration, let a = 3, b — 2. By applications of Axs. 17—19, 
exactly as above except that c' replaces 0” (or taking c' instead of 0" as 
the t of the lemma), b c'+0”W"'. Thence c'-b0'”=0 , ' b c”"=0” 
b c"=0 [using Ax. 14 twice]. But by Ax. 15, |- i c''=0. By 
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reductio ad absurdum (-i-introd.), |- -i c'-fO'"—0"; whence by V-introd. 
and *86 §35, b -i3c(c'+0'"=0"), i.e. b “i3c(c'-f 3=2). Similarly for 
any other a and b such that a ^ b, f- -i3c(c' + a—6). 

Thus 3 c(c'-\-a=b) numeralwise expresses a<b in the formal system 
without Axiom Schema 13. But we cannot expect that in this system 
formulas expressing general properties of <, such as those of *134a— 
146b (with “a<b” as abbreviation for 3 c(c'+a=b)), will be provable, 
except in a few cases (e.g. * 135b). 

The other formula 3 c(a~\-c’=b) (which by *119 is equivalent to 
3c(c’ -\-a=b) in the full system with Axiom Schema 13) seemingly 
does not numeralwise express a<b in the system without Axiom Schema 
13. Of course it does in the full system (or even without Axiom Schema 13, 
provided *118 or *119 is supplied as an axiom). 

The numbered results of this section (beginning with *(164)) refer pri¬ 
marily to the full number-theoretic system (as throughout this chapter). 
But in fact for them we need no new applications of the formal induction 
rule (or Axiom Schema 13), provided certain particular formulas previous¬ 
ly proved by means of it are available. More precisely, we can get along 
here with the predicate calculus, the particular number-theoretic axioms 
14—21, the replacement property of equality which depends on having 
in addition only *104—*107 §38, and *137 (or *136) §39, except in a 
few cases which we will keep track of and list at the end of the section. 
(This subsystem of the full system was singled out by Raphael Robinson, 
1950 abstract*, in a connection to be discussed in §76.) 

The predicates 

*(164) a=b, *(165) a<b 

are numeralwise expressed by the respective formulas a—b and a<b, 
i.e. 3 c(c'+a=b). 

Proofs. *(165). By Example 2 ; or (in the full number-theoretic 
system) using *135a, *134a, *140, *141. 

If x is a variable, A(x) and B(x) are formulas, k is a natural number, 
y is a variable distinct from x and free for x in A(x) and not occurring free in 
A(x), and t is a term not containing x and free for x in A(x): 

*166. A(0), A(l), ..., A(k— 1) b Vx(x<ft D A(x)). 

* 166a. A(0), A(l), ..., A(k) b Vx(x<ft D A(x)). 

*167. Vx(x<ft D A(x)) b A(£) for i = 0, 1, ..., k— 1. 

*167a. Vx(x<ft D A(x)) b A(i) for i — 0, \, ..., k. 
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*168. A(t) b Vx[x>t D 3 y(y<x & A(y))]. 

*169. Vx[x<t A(x)], Vx[x>t I> B(x)] b Vx[A(x) VB(x)]. 

When k = 0, the list A( 0 ), A(l),..., A(fe— 1 ) of assumption formulas 
for *166 is empty, and there are no A(£)’s for *167. 

Proofs. *166. If we can show that A(0), A(l), . . ,,A(ft— 1), x<k 
b A(x), then * 166 will follow by 3- and V-introd. By * 137(or via its proof, 
from *137 or *136), b x=0Vx= 1 V... Vx=ft—1 Vx=k V3y(x=y (fe+1> ). 
Accordingly by V-elim. with weak -i-elim. and 3-elim., it will suffice to 
deduce either A(x) or a contradiction from A(0), A(l), ..., A(fe— 1 ), 
x<ft with each of x=0, x=l, ..., x—k— 1, x—k, x=y (fc+1) in turn. 
But (using now the informal presentation, beginning § 38) from each of 
x=0, x=l, ..., x=ft— 1 with the corresponding one of A(0), A(l), ..., 
A(ft— 1 ), we obtain A(x) by replacement (Corollary 2 Theorem 24 §38). 
From x=k with x<k by replacement, k<k, contradicting -iftcft which 
is provable by *(165). From x=y (fc+1) and x<fe by replacement, y (t+1) <fe, 
i.e. 3z(z'+y ( * +1) =ft). Assume (for 3-elim.) z'+y (fc+1) =ft. Thence by k+1 
applications of Ax. 19 (with some applications of Ax. 17), (z'+y) <fc+1) =fe; 
whence by k applications of Ax. 14, (z'-by)'=0, contradicting Ax. 15. (Cf. 
the remark in the proof of * 140 § 39.) 

*167. Since i < k, by *(165) b i<k. 

*169. Using cases from *139. Remark. We require *139 only with t 
substituted for b. When t is a numeral k, this formula a <k V a=k 

V a>k can be proved from *137 or *136 similarly to *166. (Use *(165) 
in the first k cases. For the £+ 2 -nd case, a = b {k+1) = (b') (k) = (b'+0) ik) 
[Axs. 18, 17] = b'-\-k [Axs. 19, 17].) 

Although thus each of the formulas a<0V a=0V a>0, a<\W a=\ 

V a> 1, a<2V a—2V a>2, ... is provable in the system lacking Axiom 
Schema 13 (and even Axs. 14, 15, 20 , 21 ) but having the formula of *137 or 
* 136 as additional axiom, we have no ground for believing that the formula 
a<bW a—bW a>b of *139 itself is provable in that system. 

The remainder of this section may be postponed, if the reader prefers, 
until just before § 49. 

Under the interpretation of the formal symbolism, a number-theoretic 
function (f(x v . .., x n ) is expressed by a term t(x 1; ..., x„). 

Example -3. The function (a + 1) 2 is expressed under the inter¬ 
pretation by the terms (a')-(a'), aa+(2a+ 1 ), etc. 

The only number-theoretic functions which can be thus directly ex¬ 
pressed are the polynomials. However, we shall find that it is possible 
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to paraphrase many propositions in which other number-theoretic func¬ 
tions occur, so that those propositions become expressible in the formal 
symbolism despite the lack of terms expressing the functions themselves. 

For let y(x v ..x n ) be a given number-theoretic function, and let 
P(x 1 ,..., x n , w) be the predicate 9 (x v ..., x„)=w, which we call the 
representing predicate of the function <p(x v ..., x n ). If the predicate 
P{x lt .. ., x n ,w) is expressed in the system by a formula P(x x ,..x n ,w), 
and C(x) is a predicate expressed by C(x), then C(<p(x v ..., x n )) is expressed 
by 3w(P(x 1 ,.. .,x„,w) & C(w)) (and also by Vw(P(x 1 ,. . .,x„,w) ID C(w))). 

This suggests that, if the representing predicate of a function is expres¬ 
sible in the system, we can hope to be able to express and develop the 
theory of the function in the system much as though a term for the 
function itself were available. A metamathematical investigation to 
confirm this conjecture will be undertaken later (§ 74). 

We seek some information now (and in § 49) on the question for what 
functions the representing predicates are expressible, or briefly, what 
functions are ‘representable’. Thus far we have been talking only about 
the interpretation of the symbolism, but presently we will introduce a 
notion for the representation of functions analogous to ‘numeralwise 
expressibility’ for the expression of predicates. 

The necessary and sufficient condition that a predicate P{x v ..., x n , w) 
be the representing predicate of some (single-valued) function <p(x lt ... ,x n ) 
is that for each n-tuple x v ...,x n there exists a unique w such that 
P(x v ..., x n , w). When this condition holds, the function y(x lt ...,x n ) 
represented can be defined ‘descriptively’ from the predicate P(x v ..., 
x n , w) as the w such that P{x v ..., x n , w). 

We now introduce “3!xA(x)” under the usual stipulations on the letters 
(beginning § 40, and end § 33) as abbreviation for 3x[A(x) & Vy(A(y) ID 
x=y)] (read “there exists a unique x such that A(x)”). Then if 
P(x v ..., x n , w) is expressed by the formula P(x 1 ,..., x„, w), the con¬ 
dition that P(x v ..., x n , w) be a representing predicate is expressed by 
the formula Vx x ... Vx„3!wP(x 1 ,..., x n , w), or simply by 3!wP(x 1 , 
x n , w) when x v ..., x n have the generality interpretation (§ 32). 

If x, y and z are distinct variables, A(x) is a formula, t, r and s are terms, 
r are formulas not containing x free, y, z, r, s and t are free for x in A(x), 
z and x do not occur in t, and y and z do not occur free in A(x): 

*170. If T, A(t), A(x) b t=x with the free variables held constant for 
A(x), then T, A(t) b 3!xA(x). 

*171. b 3!x(t=x). 
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*172. A(r), A(s), 3!xA(x) b r=s. 

*173. r^s, A(r), 3!xA(x) [- ~iA(s). 

(Properties of 3!.) 

*174a. A(t) &Vz(z<t 3-iA(z)) b 3!y[A(y) &Vz(z<y 3 -iA(z))]. 
*174b. b 3y[A(y) & Vz(z<y 3-iA(z))] ~ 3!y[A(y) & Vz(z<y 3-iA(z))]. 
(Uniqueness of the least x such that A(x).) 

Proofs. *174a. Using *170, *139. Cf. the remark in the proof of *169. 

An intuitive number-theoretic function cp^, ..x„) is said to be 
numeralwise representable in the formal system, if there is a formula 
P(xj,..., x„, w) with no free variables other than the distinct variables 
x 1 ,..., x„, w such that, for each particular w-tuple of natural numbers 
x lt .. .,x n , 

(v) if <p(# lf = W, then b P(*i,..., x n , w), and 

(vi) b 3!wP(x lf ...,x n , w). 

In this case, the formula P(xj,..., x„, w) numeralwise represents the 
function <p(x v ..., x n ) (with the obvious correspondence of variables). 

Our finitary (i.e. intuitionistic) use of this notion will be confined to 
cases when there is a calculation procedure for the function <p(x 1( ..., x n ) 
(§ 30), so that the w of (v) can be found for any given x v .... x„ (or will 
make this tacitly an hypothesis). 

If <f>(x v ..., x n ) is numeralwise represented by P(x x ,..., x„, w), the 
latter numeralwise expresses the representing predicate P(x v ..x n ,w) 
of <p. For from (v) and (vi), we can infer that for every w, 

(vii) if y{x v ...,x n ) ^w, then b ”>P(*i,..., x„, w), 

as follows. Take the w of (v) as the r and the w of (vii) as the s for *173. 
Use *(164) to get the provability of the r^ts. 

We have no ground to believe that, conversely, (vii) for every w together 
with (v) necessarily implies (vi). 

We have used the 31-notation to state (vi) compactly. 3!xA(x) is 
equivalent to 3xA(x) & VxVy(A(x) & A(y) 3 x=y), in which the first 
part expresses existence and the second uniqueness. The existence part 
for (vi) follows already from (v); so what (vi) adds is the uniqueness. 

In numeralwise representability of functions (just as in numeralwise 
expressibility of predicates), we are limiting ourselves to the consideration 
of the values for particular arguments, in contrast to general properties. 
The questions considered are in this way analogous to computational 
questions in informal arithmetic. 

For example, we have not required in defining ‘numeralwise repre- 
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sentability’ that the formula 3!wP(x 1 > ..x n , w) should be provable 
with Xj,..., x„ as formal variables. That would be stronger than our 
demand that (vi) hold for every x lf ..., x n with x v ..., x n as intuitive 
variables, which would follow from it by substitution (*66 § 32 with 
as the t v ...,t n ). That stronger condition (without (v)) will 
be the prerequisite for the theory to be developed in § 74. For the six 
functions we consider now, we readily obtain that also. 

The functions 

*(175) a', *(176) a+b, *(177) ab 

are numeralwise represented by the respective formulas a’=b, a+b—c, ab—c. 

Proofs. *(176) By *171 and the form of the representing formula 
a+b—c, (vi) is immediate (and even p 3 lc(a+b—c)). For (v), we must 
show that for each pair a, b of natural numbers, if c = a+b, then 
P a+6= c. For example, if a = 2 and b — 3 (then c = 5), we have 
P a+6=c (i.e. p 0"+0"'=0""') as for (i) in Example 2. 

*(177) Similarly. The proof of (v) can be conveniently arranged as an 
intuitive induction on b. Ind. step. Say c = ab, d = ab' (= ab+a — 
c-\-a). Then p ab' = ab fa [Ax. 21] = c+a [hyp. ind., *104] = d 
[by (v) for *(176)]. 

Some general principles will illuminate our treatment of the next 
examples. A prime formula is one containing no logical symbols, i.e. 
here it is s=t for some terms s and t. 

(A) Each closed prime formula s=t is formally decidable (and s=t 
is provable or refutable according as the terms s and t express the same or 
different numbers under the usual interpretation of 0, ', + , •). Each prime 
formula is numeralwise decidable. 

Proof. Use *(176), *(177), Theorem 24 § 38, and *(164). 

Example 4. Let s=t be 0"'-0""+0'=(0"'-0")", i.e. abbreviated 
3-4+1 = (3-2)". Now h 3-4+1 — (3-2)" ~ 12+l = (3-2)” [since by *(177), 
h 3-4=12] ~ 13=(3-2)" [since by *(176), p 12+1 = 13] ~ 13=8 [since 
by *(177), P 3-2=6; and noting that 6",i.e. (0""")", is 8], But by *(164), 
p —• 13=8. Hence P -i3-4 + l = (3-2)”. We have used tacitly Theorem 
24 (b) or its Corollary 1, and *21 § 26, in concluding by the chain that 
P 3*4 + 1 = (3-2)" ~ 13=8; and *30 *18b, or *20 and Corollary Theorem 
6, in combining this with p —113=8 to infer that p -i3-4+l = (3-2)". 

(B) Let P(x x ,..., x n ) be a formula containing free only the distinct 
variables x!,...,x n , and suppose P(x 1 ,...,x n ) is numeralwise decidable 
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{and numeral-wise expresses P{x lt ..., x„)). Then : If t v ...,t n are terms 
containing no variables {and therefore expressing numbers t v ..t n ), 
is decidable {and P(t 1; ..., t„) is provable or refutable according 
as P{t v .. .,t n ) is true or false). If t v ..., t„ are terms free for x v ..., x„ 
in P(Xj,..x„), P(t lf ..., t„) is numeralwise decidable. 

Proved like (A), which is the special case of (B) for which P(x x ,.. ., x n ) 
is x x =x 2 . 

(C) A formula composed out of closed decidable formulas using only the 
operators D, &, V, -i of the propositional calculus is decidable {and whether 
it is provable or refutable can be determined by use of the classical 2-valued 
truth tables § 28 taking t and f as 'provable' and ‘refutable’, respectively). 

By Lemma 13 § 29 with Theorem 3 § 25. 

(D) Hence: Each formula without variables is decidable. Each formula 
without quantifiers is numeralwise decidable. 

(E) Let A(x x ,..., x n , y) be a numeralwise decidable formula containing 
free only the distinct variables x x ,...,x n ,y; and let z be a variable 
distinct from x x , ...,x n ,y. Then Vy(y<z D A(x x , ..., x„, y)) and 
3y(y<z & A(x x ,.. .,x n ,y)) are numeralwise decidable {and Vy(y <z 3 
A(x x , ..., x n , y)) is provable or refutable, according as all of A(x x ,..., x n , 0), 
A(x x ,..., x„, 1 ),..., A(x x ,..., x„, z—1) are provable or some are refutable ; 
3y(y<z & A(x x ,..x„, y)), according as some are provable or all are 
refutable). Similarly with <, in place of <. 

Proof (for <). Use *166, *167, *(165) (also *58b §27, *86 §35). 

Consider the division of two integers, a by b. For example, 13 = 5*2+3 
where 3 < 5. In words, when 13 is divided by 5, the quotient is 2 and the 
remainder is 3. Customarily, the division process, and therewith the 
quotient function [afb] and the remainder function rm(a, b) are defined 
only for fc + 0. To avoid the trouble of discussing partially defined func¬ 
tions now, we extend the definitions to the case b = 0 by setting 
[a/0] = 0, rm(a, 0) = a. This preserves the law a = b[ajb ] + rm(a, b). 
Then b\a {“b divides a”) if and only if rm(a, b) — 0. 

The functions 

*(178) [afb], *(179) rm {a,b) 

are numeralwise represented by respective formulas Q(a, b, q) and R {a, b,r) 

such that, for any numerals q and r : 

*178a. Q{a,b,q) b 3 \qQ{a,b,q). *179a. R(a, b, r) b 3!rR(a, b, r). 
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Proofs. *(179) and *179a. Let S (a,b,r) be the formula 
3q{q<a & a=bq-\-r & r<b) V {b= 0 & r=a). 

Then let R (a, b, r) be 

S(a, b, r) & Ve(e<r 3 “iS(a, b, e)). 

Now *179a is immediate from *174a (with r as the t). 

To establish (v) for *(179), consider any pair of numbers a and b, and 
let r = rm(a, b) and q = [a,b]. Case 1: b ^ 0. Now a = bq-\-r; hence 
by (A), (- a=6qr+r. Also r < b; hence by *(165), (- r<6. By &-introd. 
(or (C)), )- a=bq-\-r 8c r<b. But q < a\ so by (E), 
b 3q(<j< a & a=bq-\-r & r<b). By V-introd., 

b 3q{<j<a & a—bq-\-r & r<6) V (6=0 & r=a), i.e. b S(a, 6, r). Now 
let e be any number < r. Then e < b (so by *(165), b e<6). For any 
number p, a ^ bp+e (since q, r is the only pair of numbers with r <b 
such that a = bq-\-r ); so by (A), b ”ia = 6p+c. Thence by (C), 
b -i(a=6p+e &c<6). In particular this holds for p — 0, 1,..., a; 
so by (E), b “i3i7 (^<a& a=bq+e &e< 6). But b^ 0; hence by 
*(164), b "i6=0. By *(164), either b e=a or b ~i e—a. Combining 
these results by (C), 

b -i (3^(<7<a & a=bcj+e & e<b) V (6=0 & e=a)), i.e. b -iS(a, 6, e). 
This was for any e < r, i.e. it holds for e = 0, 1,.. r— 1; so by (E), 
b Vc(e<r 3-iS(a, 6, e)). From this and b S(a, 6, r) by &-introd., 
b S(a, 6, r) & Ve(e<r 3 -iS(a, 6, e)), i.e. l-Rfo, 6 ,r), as was to be 
shown. Case 2: b = 0. Similarly. (Summarizing; From the form of 
R(a, b, r), by (A), *(165), (C) and (E), R(a, b, r) is numeralwise decida¬ 
ble. By formal steps paralleling the interpretation, we verify that 
R(a, 6, r) is provable rather than refutable when r = rm(a, b).) 

Now (vi) for *(179) follows by substituting any numerals a, 6 for a, b 
in *179a, and using (v). 

The function 
*(180) rm(c, {i'-d)') 

is numeralwise represented by a formula B(c, d, i, w) such that, for any 
numeral w : 

*180a. B(c, d, i, w) b 3 !wB(c, d, i, w). 


Proof. Let “c”, “d”, “i”, "v”, "w” denote c, d, i, q, r, respectively; 
andlet B(c, d, i, w) be R(c, (i'-d)', w). Use *179a, substitution (*66), *(179) 
and (B). 


The foregoing treatment does not give the stronger property that 
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3!wP(x x ,..x n , w) for the formulas numeralwise representing 
[ajb], rm(a, b) and rm(c, (i'-d)'). However using *146a and *146b (with 
*123, *140, *141, *142a and *143a), this can be established also, and 
simpler representing formulas can be given equivalent to the former. 
*178b. \- Q (a, b, q) ~ 3 r{a—bq+r & r<b ) V b=q— 0. 

*179b. |- R (a,b,r) ~ 3q(a=bq-\-r & r<b) V (b=0 & r—a). 

*180b. h B(c, d, i, w) ~ 3v(c=(i'-d)'-v+w & w<(i'-d)'). 

*178c. h3 \qQ(a,b,q). *179c. h 3!rR(<*, b, r). 

*180c. h 3!wB(c, d, i, w). 

Lemma 18a. The results *(164) — *180c and (A) — (E) of this section, 
excepting * 169 and * 174a when t is not a numeral, * 174b, * 178b, c, * 179b, c 
and *180b, c, hold good for the formal system lacking Axiom Schema 13 but 
having as additional particular number-theoretic axioms the formulas of 
*104—*107 and of *137 or *136 (“Robinson’s system”). 

§ 42. Gddel’s theorem. From a result of Presburger 1930, meta- 
mathematical proofs of consistency and completeness, and a decision 
procedure, can be given for the formal system with the formation rule 
and axioms for • omitted. (Cf. Example 2 § 79. Presburger deals with 
a classical system of the arithmetic of the integers, but Hilbert and 
Bernays 1934 pp. 359 ff. adapt his method to essentially the present clas¬ 
sical system, and Joan Ross has verified that the adaptation works for 
the intuitionistic system as well.) 

For the full system (or systems essentially equivalent to it), these 
questions proved to be very refractory. Consistency proofs by Ackermann 
1924-5 and von Neumann 1927 lead to the result that the system is 
consistent under the restriction on the use of the induction postulate 
(Axiom Schema 13) to the case that the induction variable x does not 
occur free within the scope of a quantifier of the induction formula 
A(x). (Cf. Theorem 55 § 79. The restriction excludes e.g. our proofs of 
*105, *136 and *148.) 

This situation was illuminated in 1931 by the appearance of two 
remarkable theorems of Godel “on formally undecidable propositions 
of Principia Mathematica and related systems”. We designate the first 
of these theorems, which entails the other as corollary, as “Godel’s 
theorem”, although it is only one of a series of important contributions by 
its author. These two theorems, which became the most widely noted in the 
subject, bear on the whole program and philosophy of metamathematics. 

The metamathematical results presented thus far in this book were 
reached along paths more or less suggested by the interpretation of the 
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system. These results of Godel are obtained by a kind of metamathemati- 
cal reasoning which goes more deeply into the structure of the formal 
system as a system of objects. 

As is set forth in § 16, the objects of the formal system which we study 
are various formal symbols, formal expressions (i.e. finite sequences of 
formal symbols), and finite sequences of formal expressions. There are an 
enumerable infinity of formal symbols given at the outset. Hence, by the 
methods of § 1, the formal objects form an enumerable class. By spec¬ 
ifying a particular enumeration of them, and letting our metamathemat- 
ical statements refer to the indices in the enumeration instead of to the 
objects enumerated, metamathematics becomes a branch of number 
theory. Therewith, the possibility appears that the formal system 
should contain formulas which, when considered in the light of the 
enumeration, express propositions of its own metamathematics. 

It will appear, on further study, that this possibility can be exploited, 
and with the use of Cantor’s diagonal method (§2), a closed formula A 
can be found which, interpreted by a person who knows this enumeration, 
asserts its own unprovability. 

This formula A bears an analogy to the proposition of the Epimenides 
paradox (§ 11). But now there is a way of escape from the paradox. By 
the construction of A, 

(1) A means that A is unprovable. 

Let us assume, as we hope is the case, that formulas which express false 
propositions are unprovable in the system, i.e. 

(2) false formulas are unprovable. 

Now the formula A cannot be false, because by (1) that would mean 
that it is not unprovable, contradicting (2). But A can be true, provided 
it is unprovable. Indeed this must be the case. For assuming that A is 
provable, by (1) A is false, and hence by (2) unprovable. By (intuitive) 
reductio ad absurdum, this gives that A is unprovable, whereupon by 
(1) also A is true. Thus the system is incomplete in the sense that it fails 
to afford a proof of every formula which is true under the interpretation 
(if (2) is so, or if at least the particular formula A is unprovable if false). 

The negation -iA of the formula is also unprovable. For A is true; 
hence -iA is false; and by (2), -iA is unprovable. So the system is in¬ 
complete also in the simple sense defined metamathematically in the last 
section (if (2) is so, or if at least the particular formulas A and -iA are 
each unprovable if false). 

The above is of course only a preliminary heuristic account of Godel’s 
reasoning. Because of the nature of this intuitive argument, which skirts 
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so close to and yet misses a paradox, it is important that the strictly 
finitary metamathematical proof of Godel's theorem should be appreciat¬ 
ed. When this metamathematical proof is examined in full detail, it is 
seen to be of the nature of ordinary mathematics. In fact, if we chose 
to make our metamathematics a part of number theory (now informal 
rather than formal number theory) by talking about the indices in the 
enumeration, and if we ignore the interpretations of the object system 
(now a system of numbers), the theorem becomes a proposition of or¬ 
dinary elementary number theory. Its proof, while exceedingly long and 
tedious in these terms, is not open to any objection which would not equally 
involve parts of traditional mathematics which have been held most secure. 

We can give the rigorous metamathematical proof now, by borrowing 
one lemma from results of the next two chapters. Our numbering of the 
lemmas and theorems corresponds to the logical order. 

In making use of the idea of enumerating the formal objects, practical 
considerations dictate that the indices of formal objects should be 
correlated to the objects by as simple a rule as possible. We can modify 
the above heuristic argument (inessentially) by using, rather than an 
enumeration in the usual sense, an enumeration with gaps in the natural 
numbers, i.e. a correlation of distinct natural numbers to the distinct 
formal objects, not all of the natural numbers being used in the cor¬ 
relation. We call this a Godel numbering, and the correlated number of 
a formal object its Godel number. (Sometimes separate Godel numberings 
are given of the formal symbols, of the formal expressions, and of the 
finite sequences of formal expressions. If that is done, then when one 
speaks of a number as the Godel number of a symbol, or of an expression, 
or of a sequence of expressions, in each case a different correlation is being 
referred to.) 

Relative to any specified Godel numbering, for any n which is the Godel 
number of a formula, let “A„” designate the formula. (For other n’s, 
we need not define A„.) We may write this formula A„ also as “A n (a)”, 
showing the free variable a for use with our substitution notation (§ 18). 

Lemma 21. There is a Godel numbering of the formal objects such that 
the predicates A (a, b) and B(a, c) defined as follows are numeralwise ex¬ 
pressible (§41) in the formal system. 

A(a, b): a is the Godel number of a formula (namely A a (a)), and b is the 
Godel number of a proof of the formula A a (a). 

B(a, c): a is the Godel number of a formula (namely A a {a)), and c is the 
Godel number of a proof of the formula -iA„(a). 
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Now let A (a, b) and B (a, c) be particular formulas which numeralwise 
express the predicates A{a,b) and B{a,c), respectively, for the Godel 
numbering given by the lemma. The two formulas A {a, b ) and B (a, c ) 
could actually be exhibited, after we have the proof of the lemma (to be 
completed in § 52). 

Consider the formula Vb -iA (a, b) which contains a and no other 
variable free. This formula has a Godel number, call it p, and is then the 
same as the formula which we have designated “A v {a)”. Now consider 
the formula A P (p), i.e. 

A „(p): Vb-\A(p,b), 

which contains no variable free. Note that we have used Cantor’s diagonal 
method in substituting the numeral p for a in A v (a) to obtain this formula. 

To relate this to the preliminary heuristic outline, we can interpret 
the formula A „{p) from our perspective of the Godel numbering as 
expressing the proposition that A P (p) is unprovable, i.e. it is a formula 
A which asserts its own unprovability. 

In the metamathematical argument, the assumptions of the heuristic 
argument that the system should not allow the proof of either of the 
formulas A or -i A if false will be replaced by metamathematical equiva¬ 
lents. For the unprovability of A if false, this equivalent will be the (simple) 
consistency of the system (§28). For the unprovability of -iA if false, 
we shall need a stronger condition called ‘co-consistency’ which we shall 
now define. 

The formal system (or a system with similar formation rules) is said 
to be co -consistent, if for no variable x and formula A(x) are all of the 
following true: 

b A(0), t-A(l), b A(2), ...; b iVxA(x) 

(or in other words if not both b A (n) for every natural number n and 
b -iVxA(x)). In the contrary case that for some x and A(x) all of A(0), 
A(l), A(2), ... and also -iVxA(x) are provable, the system is co -in¬ 
consistent. 

Note that co-consistency implies simple consistency. For if A be any 
provable formula containing no free variables, writing it as “A(x)” 
where x is a variable, all of A(0), A(l), A(2), ... are provable (under our 
substitution notation § 18, eaeh of these is simply A itself); and hence 
if the system is co-consistent, -iVxA(x) is an example of an unprovable 
formula (cf. §28). 

Theorem 28. If the number-theoretic formal system is {simply) consis¬ 
tent, then not \- A v (P) ; and if the system is co -consistent, then not b ~i A, t (p) . 
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Thus, if the system is <o- consistent, then it is {simply) incomplete, with 
Aj ,(p) as an example of an undecidable formula. (Godel’s theorem, in the 
original form.) 

Proof that, if the system is consistent, then not b A„(p). Suppose 
(for intuitive reductio ad absurdum) that b A„(p), i.e. suppose that 
A P (p) is provable. Then there is a proof of it, let the Godel number of 
this proof be k. Then A{p, k) is true. Hence, since A {a, h) was introduced 
under the lemma as a formula which numeralwise expresses A (a, b), 
b A(p, ft). By 3-introd., b 3/»A(p, b). Thence by *83a, b -iA(p, h). 
But this is b ~iA„ Cp)- This, with our assumption that b A „(p), con¬ 
tradicts the hypothesis that the system is consistent. Therefore by re¬ 
ductio ad absurdum, not b A ,(p). as was to be shown. (We could also 
have contradicted the consistency by using V-elim. to infer b ~iA(p, ft) 
from b A„(p).) 

Proof that, if the system is co-consistent (and hence also consistent), 
then not b “»A P (p). By the consistency and the first part of the theorem, 
A s (p) is not provable. Hence each of the natural numbers 0, 1, 2, ... is 
not the Godel number of a proof of A P (p); i.e. A(p, 0), A{p, 1), A{p, 2), 
... are all false. Hence, since A {a, h) numeralwise expresses A{a,b), 

b -iA(p, 0), b “i A (p, 1), b ~iA(p, 2),_By the ^-consistency, then 

not b “i 'ib -i A (p, b). But this is not b “i A „(p), which was to be shown. 

We have given the original Godel form of the theorem first, as the 
proof is intuitively simpler and follows the heuristic outline. Rosser 1936 
has shown, however, that by using a slightly more complicated example 
of an undecidable formula, the hypothesis of ^-consistency can be dis¬ 
pensed with, and the incompleteness proved from the (simple) con¬ 
sistency alone. Consider the formula V/>[-iA(a, b) V 3 c(c < b & B {a, c))]. 
This has a Godel number, call it q. Now consider the formula A,(q), i e. 

A Q {q): Vb [-1A (q, b) V 3c(c < b & B(g, c))]. 

We can interpret the formula A„(g) from our perspective of the Godel 
numbering as asserting that to any proof of A„(qr) there exists a proof of 
">A q (q) with an equal or smaller Godel number, which under the hy¬ 
pothesis of simple consistency implies that A 9 {q) is unprovable. 

Theorem 29. If the number-theoretic formal system is [simply) con¬ 
sistent, then neither b A M nor b “i A Q {q)\ i.e. if the system is consistent, 
then it is {simply) incomplete, with A Q {q) as an undecidable formula. 
(Rosser’s form of Godel’s theorem.) 
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Proof that, if the system is consistent, then not b A Q (q). Suppose 
that b A a (g). As before (using q instead of p), b A(9, ft). Also, under our 
hypothesis of consistency, the assumption that b A Q (q) implies that 
not b “i A„(<j), i.e. -iA Q (q) is unprovable. Hence, in particular, each of 
B(q, 0), B(q, 1), .... B(q, k) is false. Since B (a, b) numeralwise expresses 
B(a,b), therefore b “iB(g, 0), b “«B(g, 1 ), ..., Hence 

by *166a, b Vc(c^ft D -iB(q, c)). This with b A (q, ft) gives by &- and 
3-introd., b 3i»[A(g, b) & 'ic(c<,b D -1 B(g, c))]. Thence by *58b and 
*86, b 3b[A(q, b) & -\lc{c<>b & B(q, c))]. Thence by *57b (and *70), 
b 3/>-i [-iA(g, b) V 3c(c<b & B(q, c))]. Thence by *85a, 
b —tVi»[—1 A(qr, b) V 3 c{c<,b & B (q, c))]. But this is b “iA „{q). Hence as 
before, not b K(q), as was to be shown. 

Proof that, if the system is consistent, then not b A,(g). Suppose 
that b “»A q (q), i.e. that -iA „(q) is provable. Then there is a proof of it; 
let the Godel number of this proof be k. Then B(q, k) is true. Hence 
b B(q,k). By *168, b 'ib[b'^.k D 3c(c <,b&B(q, c))]. Also as before 
(with q instead of />), b ~iA(qf, 0 ), b “iA (q, 1), ..., b “iA(g, k— 1). 
By *166, b Vb[b<k 3 ~'A(q, b)]. Now by *169, b V^[-iA(q, b) V 
3c(c<,b & B(^r, c))]. But this is b A a (q), Hence not b ~«A Q (q), as was 
to be shown. 

Observe that we have not shown outright that A„(p), -iA,,(p), A,(g), 
-i A„(g) are unprovable, but only that if the system is (simply) consistent, 
A„(p), A q (q), ->A q (q) are unprovable, and if the system is co-consistent, 
“iAj,(p) is unprovable. 

Consider our demonstration that A„(p) is unprovable, if the system 
is consistent. If a demonstration of the consistency of the system were 
now supplied, prefixing it to the former would complete a demonstration 
that Aj ,(p) is unprovable. 

Supposing such a demonstration that A„(p) is unprovable to exist, we 
could, using the representation of the formal objects by Godel numbers, 
express it as a demonstration in informal number theory. We now ask 
whether the latter demonstration could be formalized in the system. 

In formalizing it, the formula A„(p) would itself be the formalized 
statement of what is demonstrated, i.e. that A P (p) is unprovable. Thus 
a formalized demonstration that A„(p) is unprovable would be a formal 
proof of Aj,(p). By Theorem 28, such a proof cannot exist if the system is 
consistent. 

Thus, if we had an informal demonstration that A„(p) is unprovable, 
the demonstration would be incapable of being formalized within the 
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system, if the system is consistent. The supposed informal demonstration 
was to consist of two parts, first a supposed proof of the consistency of 
the system, and second the proof we have already given (for the first half 
of Theorem 28) that A„(p) is unprovable if the system is consistent. 

By showing that the second part actually can be formalized in the 
system, we have a method of showing that the first cannot be if the 
system is consistent. This gives the next theorem. We shall recapitulate 
this argument before stating the theorem. 

The assertion that the system is (simply) consistent can be expressed 
in the formal system via the Godel numbering variously. Let C(a, b) be 
(D(a, c) be) the predicate: a is the Godel number of a formula, namely 
A OJ and b is (c is) the Godel number of a proof of A a (of nA 0 ). There 
are formulas C(a, b) and D [a, c) expressing C[a, b) and D[a, c), respec¬ 
tively (§ 52). The original definition of consistency in § 28 is then rendered 
directly into the formalism by the formula -\3a[3bC(a, b) & 3cD(a, c)]. 
By the second version of the definition, and the fact that —11 =0 is 
provable (Ax. 15), the system is consistent, if and only if the particular 
formula 1 = 0 is unprovable. Let r be the Godel number of this formula. 
Then A r (r) is the same formula, and consistency is expressed by 
—i3bA(r,b) or ib — iA(r, b). Let us call one of these formulas, at our 
preference, ‘‘Consis”. 

The assertion that A a ,(p) is unprovable is expressed, via the Godel 
numbering, by 'ib —iA( p, b), which is A v [p). 

The intuitive demonstration of the first half of Theorem 28 is a dem¬ 
onstration that 

(I) {the system is consistent} implies {A v (p) is unprovable}. 

It is now proposed that the entire metamathematical demonstration of 

(I) should be formalized in the system, using the Godel numbering, so 
that we should then have 

(II) |- Consis D A„(p). 

Now assume metamathematically that h Consis. Then from (II) we 
should have by D-elimination, \- A t ,(p). By Theorem 28, this is im¬ 
possible, if the system is consistent. By metamathematical reductio ad 
absurdum, this would give the following theorem. (The proof could 
also be based on Theorem 29, since 1- ib -iA (q, b) 3 A Q (q).) 

Theorem 30. If the number-theoretic formal system is [simply) con¬ 
sistent, then not 1- Consis; i.e. if the system is consistent, then there is no 
consistency proof for it by methods formalizable in the system. (Godel’s 
second theorem.) 
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The proofs of Theorems 28 and 29 will be complete when we have 
established Lemma 21 in Chapter X. For the proof of Theorem 30, there 
will remain the gap to be filled in passing from (I) to (II). This is an 
exercise in formalizing an informally given proof of considerable length, 
which we shall not take the space to carry through in this book. 

Hilbert and Bernays 1939 carry it out for a certain formal system Z^ 
(cf. pp. 283 ff., especially pp. 306—324), and thence infer (pp. 324—328) 
that the theorem holds also for a system Z which differs from our classical 
number-theoretic system in inessential respects (mainly, in the use of 
predicate variables, cf. end § 37, and in the equality postulates, cf. § 73). 
Thence Theorem 30 holds for our classical system; and it can be inferred 
to hold for our intuitionistic system (at least when Consis is V/> -1 A(r, b) 
or -i3bA(r, b), and A (a,b) is suitably chosen) by use of Godel 1932-3 
(cf. Theorem 60 (b2) § 81 and Nelson 1947 pp. 326—327). 

It may be remarked that this exercise is required only for the sake of 
using as the formula Consis a direct formalization of the original def¬ 
inition of (simple) consistency or a close equivalent. Intuitively A ,(p) 
itself expresses an equivalent, via the long intuitive proof of Godel’s 
theorem. Fop by Theorem 28, if the system is consistent, A P (p) is un- 
pro vable, and by § 28, if A „(p) is unprovable, the system is consistent; 
and the unprovability of A „(p) is expressed by A „(p). 

What is the significance of these results for the proposed program 
of metamathematics? We certainly hope that the formal system is 
consistent. If so, then by Theorem 29 it is necessarily incomplete. We 
have not succeeded in formalizing informal number theory completely 
explicitly, so that each proposition or its negation is a consequence by 
explicitly stated rules of explicitly stated axioms (§ 15). 

Taking the supposition of (simple) consistency in its finitary meaning, 
i.e. if the consistency is capable of being proved metamathemati- 
cally, then the system is incomplete also in the sense that there are 
expressible in it propositions true on finitary grounds, but unprovable 
formally, three such propositions being expressed by A „(p), A,(qr) and 
Consis. 

For the problem of proving consistency metamathematicallv, Theorem 
30 has the consequence that the methods which we must trust in the 
proof must include some which lie outside the collection of the methods 
formalized in the system. This is not a priori incompatible with the other 
requirement that the methods not include all which lie inside, our mis¬ 
trust of some of which was the occasion for the formalist project of at¬ 
tempting the proof. It does pose the challenge to the metamathematician 
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to bring to bear methods of finitary proof more powerful than those 
commonly used in elementary number theory. 

There is a further implication for the completeness and consistency 
problems. Suppose our system is (simply) consistent. Then, as in the 
proof of the second half of Theorem 28, A„(p) is unprovable, but 

r -iA(p, 0), b -iA(p, 1), b - 'A(p, 2). Thus we have a formula 

A(x) (namely -iA(p, b)) such that b A(0), b A(l), b A(2), ..., but not 
b VxA(x) (which is A v (p)). Tarski 1933a gives the name w -incompleteness 
to this situation. If in this situation b -iVxA(x) (which is -iA„(p)), the 
system would be co-inconsistent. The discovery that a system may be 
o-incomplete reveals the possibility that it may be w-inconsistent without 
being simply inconsistent. The system obtained from our system by taking 
-iAj,( p) as a new axiom definitely is, under the assumption that ours 
is simply consistent. To see that this system is simply consistent, we 
observe that if B and -iB were provable in it, then in the original system 
B and uB would be deducible from —1 A„(p) (end §20); hence by 
-i-introd., -1-1 A„(p) would be provable; and by -i-elim. (or intuitionisti- 
cally by IVa 2 D IVaj from Corollary Theorem 17 §35), A„(p) would be 
provable, contradicting Theorem 28. There are evidently definable still 
higher orders of completeness and consistency. 

We don’t want our system to be w-inconsistent, even if it is consistent. 
In particular, if the simple consistency were provable metamathematical- 
ly, then the formula ~iA P (p) would under the interpretation express a 
proposition contradicting one that is true on finitary grounds; and in 
case -iAp(p) were provable, following Hilbert and Bernays (1939 p. 282) 
we should call the system externally inconsistent, i.e. inconsistent with 
respect to the finitary interpretation. Thus a proof of simple consistency 
alone would not secure the formalized mathematics against the possi¬ 
bility of establishing something intuitively false. (This was first noted 
by Finsler 1926 in connection with a system that is not formal in our 
sense. Also cf. Godel ig3i-2a.) 

Suppose our system is simply consistent. For brevity, call a formula 
"true” ("false”), if it expresses a true (false) proposition under the inter¬ 
pretation. Let B(x) be a formula which numeralwise expresses a pred¬ 
icate B(x), which it shall express under the interpretation. Then for each 
natural number x, the formula B(x) is provable if and only if it is true. 
The formula VxB(x) is true if it is provable (in view of V-elim.); but not 
in general conversely (e.g. A„(p)). The formula 3xB(x) is provable if it 
is true (in view of 3-introd.); but we have not shown the converse meta- 
mathematically (e.g. 3bA(p, b), from which —iA v (p) follows by *83a, 
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and which we thus know to be false, but do not know without assuming 
also co-consistency to be unprovable). 

Thus (cf. § 14) a proof of (simple) consistency would be sufficient to 
justify the use of our classical system for proving “real” statements 
intuitionistically by an excursion through the “ideal”, for “real” state¬ 
ments of the forms B(x) and VxB(x), but not so far as we have yet shown 
for those of the form 3xB(x). 

Suppose that we add to the system A „(p), A Q (q) or Consis as a new 
axiom, and iterate the whole process, to obtain a succession of systems. 
It can be shown that if these systems are consistent, then the class of 
provable formulas of the form VxB(x) is successively enlarged. Godel 
1931-2 states that the same is true of the systems obtained by admitting 
successively higher types of variables (at least, supposing ^-consistency). 
Except that we lack proofs of the appropriate consistency properties, 
this shows (cf. § 14) that successively higher theoretical constructions do 
add to the class of “real” statements of the original sort which are 
comprised. Godel 1936 states also that in the higher systems infinitely 
many of the previously provable formulas have very much shorter proofs. 

Thus far we have Godel’s theorem only for our particular formal 
system (except for the last remarks, whichreferto a succession of systems). 
The question arises now whether it may not depend on some peculiarities 
of the present formalization of logic, and might be avoided in some other. 
In the next chapters, besides completing the proof of the required lemma 
for Godel’s theorem, we shall reach a standpoint from which we can discuss 
these questions for formal systems in general, with the formal system 
studied here as an example (§§ 60, 61). 
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PRIMITIVE RECURSIVE FUNCTIONS 

§ 43. Primitive recursive functions. To establish the lemma for 
Godel’s theorem, we shall develop an intuitive theory about a certain 
class of number-theoretic functions and predicates, eventually showing 
that every predicate of the class is numeralwise expressible in the formal 
system (§ 49), and that the two predicates A (a, b) and B(a, c) of the lemma 
belong to the class (§ 52). This will save us much of the labor of a step by 
step development within the formal system. 

Except for the application just described, the theory of these functions 
and predicates will be developed independently of the formal system of 
the preceding chapters. In this theory, as in metamathematics, we shall 
use only finitary methods. 

The series of the natural numbers 

0, O', 0", O'", .... 

or 0, 1, 2, 3, .... we described as the class of the objects generated from 
one primitive object 0 by means of one primitive operation ' or -f 1. 
This constitutes an inductive definition of the class of the natural numbers 
(§ 6 ). 

Proof by induction, as a method of proving a theorem T(y) for all nat¬ 
ural numbers y, corresponds immediately to this mode of generating the 
numbers (§ 7). Definition by induction (not to be confused with ‘inductive 
definition’, §§ 6, 53), also called recursive definition, is the analogous 
method of defining a number-theoretic function <p(y) or predicate P(y). 
First <p(0) or P(0) (the value of the function or predicate for 0 as argument) 
is given. Then, for any natural number y, 9 (y') or P(y') (the next value 
after that for y) is expressed in terms of y and 9 (y) or P(y) (the value for y). 
Analogously, we can conclude that under these circumstances the value 
9 (y) or P(y) of the function or predicate is defined for every natural 
number y. For the two parts of the definition enable us, as we generate 
any natural number y, at the same time to determine the value 9 (y) or 
P(y). 
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To examine this in more detail, let us write the pair of equations 


( 1 ) 


[ ?(0) = q, 

\ ?(y') = x(y> <p(y))> 


to express the definition of a function y(y) by induction on y, where q 
is a given natural number, and yjy, z) is a given number-theoretic function 
of two variables. 


Then for example, the value <p( 4 ) is determined thus. To generate 4 , 
we generate successively 0, 1 , 2 , 3 , 4 . By the first equation, the value 
9(0) shall be the given number q; then by the second equation, the value 
<p(l) shall be y( 0 , cp(0)), i.e. (using the value cp(0) already found) x( 0 , q), 
which (since y_(y, z ) is a given function) is a given number; again the value 
9(2) shall be x(l. 9O)). ie - x0> x(°> ?))l the value 9(3) shall be x(2, 9(2)), 
i.e. x( 2 , x 0 > x( 0 . q)))l and finally the value 9(4) shall be x( 3 , 9(3)), i.e. 
x( 3 . x( 2 » x(l» X(°. ?))))• 

Thus we have a process by which, to each natural number y, on the 
basis of the generation of y in the natural number sequence, a corre¬ 
sponding number 9(y) is determined. Since a number <p(y ) is thus associat¬ 
ed with y, for each y, a particular number-theoretic function 9 is defined 
with these numbers 9(y) as its respective values. 

This function 9 satisfies the equations (1), when (1) are considered as 
functional equations in an unknown function 9, since every particular 
equation comprised in (1) (namely, 9(0) =q, 9(0')=x(0, 9(0)), 9(1') = 
x(l, 9(1))» • • •) is satisfied in the course of selecting the successive num¬ 
bers 9(0), 9(1), 9(2), .... Also this 9 is the only function satisfying 
(1) as functional equations, since the process by which we determined the 
successive numbers 9(0), 9(1), 9(2), ... from the equations (1) can be 
interpreted as showing that any function 9 satisfying the equations must 
have the values selected. 


In other definitions by induction, the function 9 defined depends on 
additional variables x 2 > ..x n , called parameters, which have fixed values 
throughout the induction on y. 


Example 1 . Consider intuitively the equations 

f 0+0 = a, 
l a+b' = («+&)', 

which we encountered in the formal symbolism as Axioms 18 and 19 . 
These define the function a+6 by induction on b, with a as parameter, 
and ' as a previously known function. Then the equations 

| a -0 = 0, 

\ a-b' = (<*•&)+<* 





define a-b by induction cn b, with a-\-b as a known function; and 

f «» = 1, 

\ a b ' = a b -a 

define a b by induction on b, with a-b as a known function. 

An example of a definition of a predicate by induction will be given 
later (Example 2 § 45 ). 

What number-theoretic functions are definable by induction ? To make 
this question precise, we must specify what functions are to be taken 
as known initially, and what operations, including what forms of definition 
by induction, are to be allowed in defining further functions. 

We shall now select the specifications with a view to obtaining functions 
definable by induction in an elementary manner. These functions will be 
called ‘primitive recursive’. 

Each of the following equations and systems of equations (I)—(V) 
defines a number-theoretic function <p, when n and m are positive integers, 
i is an integer such that 1 < i <; n, q is a natural number, and tj', Xi> • • • > 
y m , i are given number-theoretic functions of the indicated numbers of 
variables. 


(I) 

(II) 
(HI) 

(IV) 

(Va) 

(Vb) 


<f{x) = 

<p(*j,. ...x n ) = q. 

?(*!,•• •>*») = X 0 

?("^1> • • • > X-n) ^ ^(Xl^l* • • • > X n ),. . ., X*n(^l' • • • < X n ))> 

| cp(O) = q, 
l ?(y') = x(y> <p(y))- 

9(0, X 2 ,. . ., X n ) ^ ^(%2> • • • > ^n)> 

9 {y'< Xi,..., x n ) = x(y, 9(y> x 2 ,..., x„), x^,..., *„). 


((Va) constitutes the case of (V) for n = 1, and (Vb) for n > 1.) 

A function is primitive recursive, if it is definable by a series of ap¬ 
plications of these five operations of definition. 

This definition can be given in more detail, analogously to the def¬ 
inition of provable formula for the formal system (§ 19 ), say using the 
second version, as follows. 

We refer to the above equations and equation pairs (I)—(V) as sche¬ 
mata. They are analogous to the postulates, with (I)—(III) in the role of 
axiom schemata (or more strictly, (I) to a particular axiom), and (IV) and 
(V) in the role of rules of inference. 

A function 9 is called an initial function, if 9 satisfies Equation (I), 
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or Equation (II) for a particular n and q, or Equation (III) for a particular 
n and i. 

A function 9 is called an immediate dependent of other functions, if 9 
satisfies Equation (IV) for a particular n and m with <];, Xi* ..., Xm as the 
other functions, or Equations (Va) for a particular q with y as the other 
function, or Equations (Vb) for a particular n with x as the other func¬ 
tions. 

A function 9 is called primitive recursive, if there is a finite sequence 
9i,...,9 fc (&^1) of (occurrences of) functions (called a primitive re¬ 
cursive description of 9), such that each function of the sequence is either 
an initial function, or an immediate dependent of preceding functions 
of the sequence, and the last function 9* is the function 9. 

§ 44 . Explicit definition. The first problem of this chapter is to 
recognize as primitive recursive various functions which may already be 
known to us in other ways. (Analogously, in studying the formal system, 
we deduced from the axioms further formal theorems, and derived rules 
as general methods for finding still others.) 

The schemata have been given stereotyped forms to simplify the def¬ 
inition of the class of primitive recursive functions. In the remainder of 
this section, we shall learn to telescope several applications of them. 

Schema (I) gives the successor function as one of the initial functions. 
In this connection, we designate it as S. The initial functions given by 
Schema (II) we call the constant functions, and we designate them as C”. 
The initial functions given by Schema (III) we call the identity functions, 
and we designate them as £ 7 ”. 

Schema (IV) we call the schema of definition by substitution. The expres¬ 
sion for the ambiguous value of 9 is obtained by substituting expressions 
for the ambiguous values of Xi.• • •» Xm for the variables of 4 1 (cf. § 10). 
The function 9 defined by an application of this schema we sometimes 
write as S m(+. Xl.;--.Xm)- 

An explicit definition of a function consists in giving an expression for 
its ambiguous value constructed syntactically from its independent 
variables (with no other variables occurring free) and symbols for given 
functions, constants, operators, etc. In particular, we say that a 
function 9 is definable explicitly from (or is explicit in) functions 
and constants q lt ...,q„ if an expression for its ambiguous 
value <p(x v ..., x n ) can be given in terms of the variables x v . .., x n , the 
constants q x ,...,q, and the functions (cf. Example 2 § 10 ). 

In this case 9 can be obtained from ^1,..., <p t by a series of applications 
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of Schemata (II) — (IV). For Schemata (II) and (III) introduce each con¬ 
stant and each of the variables x n as a function of all the variables 

x v ...,x n ] and then the substitutions used in building up the expression 
for the ambiguous value (p(x lt ..., x n ) all fit the standard form (IV). 

Example 1 . Consider the explicit definition 

(a) cp(x, z, y) = £(*, ij(y, 6 (x)), 2 ). 

Considering x, y and 2 on the right as each a function of x, z, y, 
y(x, z, y) = £( U\{x, z, y), r^U^x, z, y), 0(Uf(x, z, y))), C\{x, z, y)). 

Thence we see that the following series of applications of Schemata 
(II) — (IV) can be used to define 9 from r\, 0. The successive functions 
used are named or defined at the left; and the applications of the schemata 
are analyzed at the right. For example, at Step 5, Schema (IV) is applied 
with n = 3 and m = 1 and with the preceding functions of Steps 3 and 4 
as the and y of (IV). 

1 . X, — first given function. 

2 . 7 ) — second given function. 

3. 6 — third given function. 

4. U\(x, z,y) — x — (HI), n — 3, i — 1 . 

5. 0 x (x, z, y) = 6(U?(x, z, y)) — (IV), n = 3, m = 1 ; 3, 4. 

6 . U\{x, z,y) — y — (HI), n — 3, i = 3. 

7. <H*, z, y) = 7 iU\[x, z, y), ^(x, z, y)) - (IV), n = 3, m = 2; 2, 6 , 5. 

8 . Cl(x, z, y) = 2 — (II), n = 3, q = 2. 

9. 9 (x,z,y) =X(U\{x,z,y), <] >(x,z,y), C\{x,z,y )) — (IV), n = 3, 

m = 3; 1, 4, 7, 8 . 

Note that this definition of 9 from £, 7 ), 0 can be expressed symbolically 
thus, 

(b) <? = SKC C/?,S 2 3 (t), C/|, Sf(0, t/?)),C 2 3 ). 

If C, 75 , 0 are primitive recursive, then so is 9 ; and a primitive recursive 
description 9 1( ..., 9 * of 9 is then ..., £,..., 7 ),..., 0, U\, Oj, U\, C\, 9 

where ..., £; ..., 7 ); ..., 0 are descriptions of X, tj, 0, respectively. 

This use of the identity functions C7“ in the analysis of explicit definition 
is due to Godel 1934 . 

Schema (V) is the schema of primitive recursion, without parameters 
(Va) or with parameters (Vb). We sometimes write the function 9 so 
defined as R*(x) (for (Va)) or R n (^, x) (f° r (Vb)). However in speaking of 
a “primitive recursion”, we shall now understand that the application of 
(V) may have lumped with it some steps of explicit definition. 
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Example 2. To analyze the primitive recursion for a-\-b (Example 
1 § 43 ), first let us restate it writing <p(b, a) for a+b, 

I 9(0, a) = a [= £/}(«)], 

l *(&',«)-(9(6. a))' [=x(*. 9 (M.«). 

if X (b,c, a) = c' = S(Ul(b,c,a))]. 

This fits Schema (Vb) when the right members are expressed as shown 
in brackets. So we accomplish the definition thus: 


1. S(a) = a’ — (I). 

2. U\(a) = a — (III), n = 1, i = 1. 

3. U\{b, c, a) = c — (III), n = 3, i = 2. 

4. x( 6 > a ) = S(U%(b, c, a)) — (IV), » = 3, m — 1; 1,3. 

5 - \ 1 = U }b {a \b a) a\ ~ (Vb)> n = 2: 2> 4< 

l <?(o » «) = x(o> <P(®> a )> a ) 


This shows that a+b considered as 9 {b, a) (i.e. 9 = \baa+b, cf. 
Example 3 § 10) is primitive recursive, with S, U\, U\, x, 9 as a primitive 
recursive description. We can obtain a-\-b as 9 1 (a, b) (i.e. 9 X — lab a+d) 
by three more steps. Symbolically, 

Xfcaa+&= R 2 (C/}, S?(S, U\)), 
laba+b= S|(R 2 (C/J, S?(S, U\)), U\,U\). 

This illustrates the general method. By the commutative property of 
a-\-b, 9!(a, b) = 9(a, b), so the last three steps could be omitted here; 
but not e.g. in treating the recursion for a 6 . 

We now use these techniques to establish the primitive recursiveness 
of a series of functions. Each of the functions listed below at the left 
is primitive recursive. To verify this, the reader may recognize, first, 
that the explicit definitions and primitive recursions exhibited at the 
right do generate primitive recursive functions, and second that the 
functions generated are the same as those defined or named at the left. 

#1. a+ft. 

#2. a-b. 

# 3 . a b > (also written: 

a exp b). 

# 4 . a!. 


( a-j-0 = a, 

1 a+b' = (a-f-5)', 
I a-0 = 0, 

\ a-b' = a-b+a. 

\ a 6 ' = a b -a. 

f 0! — 1, 

l a'! = a!-a'. 
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predecessor 
pd(a) = - of a if a > 0, 

0 if a = 0. 

, _ f a—b iia >b, j a —0 = a, 
| 0 if a < b. \ a—b' — pc 


pd(0) -0, 
pd(a') = a. 


min (a,b). 
min (a lt ...,a n ). 

raa x(a,b). 
maxfflj,., a n ). 

, f 1 if a = 0, 

Sg(fl) “{ 0 if a > 0. 

, . f 0 if d = 0, 

S * ( ‘> ={ 1 if a > 0. 

I o-b I- 

rm(a,6) (cf. § 41 ). 

[ajb]. 


[ a—b' — pd (a—b). 
min(a,6) = b—(b—a). 
min(a 1 ,... ,a n ) = 
min(.. .min(min [a v a 2 ),a 3 ). 
ma x(a,b) — (a-\-b)— min(«,6). 

Similarly to # 7 a. 

sg(«) = l-«» nr f sg(0) = 1, 

or sg(fl) = 0°, \ sg(a') = 0. 

sg(«) = sg(sg(«)), or f sg(0) = 0, 
orsg(a) = min (a, 1), \ sg («') = 1. 

| a—fe | = (a— b) + (b— a), 
f rm(0,6) = 0, 

\ rm (a',b) = (rm(a,&))'-sg \b— (rm(a,6))' | 

r [0/6] = o, 

l [«7^] = [«/&] + s § I 6—(rm(a,&))' |. 


Remark 1. The particular list (I) — (V) of schemata for generating 
the primitive recursive functions (Basis A) is a handy one. If constants 
be allowed as primitive recursive functions of 0 variables, a basis is 
obtained by changing (II) to 

(H B ) 9 = 0, 

allowing n = 0 or m — 0 in (IV), omitting (Va), and allowing « = 1 
in (Vb) (Basis B). This basis emphasizes the fundamental role of 0 and '. 
The constant functions C° q for q > 0 are introduced by successive ap¬ 
plications of (IV) with n = 0, m = 1, S as the <]>, and C®_j as the x; and 
Q for n > 0 by (IV) with m = 0, and C® as the cp. Substantial reductions 
in the basis for generating primitive recursive functions have been given 
by Peter 1934 (see also David Nelson 1947 Part II) and Raphael Rob¬ 
inson 1947. (It shall be understood throughout the chapter, outside of 
the present remark and Remark 1 end § 47 , that we are using Basis A.) 


§ 45 . Predicates, prime factor representation. The following 
notion of relative primitive recursiveness enters naturally into our theory 
for showing functions to be primitive recursive, just as the notion of 
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deducibility entered into our theory for showing formulas to be provable. 

A function 9 is primitive recursive in .., <J/, (briefly Y), if there is a 
finite sequence <p x> ..., <p* of (occurrences of) functions (called a primitive 
recursive derivation of 9 from Y) such that each function of the sequence 
is either one of the functions Y (the assumed functions), or an initial 
function, or an immediate dependent of preceding functions, and the 
last function 9 * is 9 . 

Since this definition has the same form as that of deducibility, to each 
of the general properties of \- (§ 20 ) a corresponding principle can be 
stated now. For example, if 9 is primitive recursive in Y, and some of 
the functions Y are primitive recursive, then 9 is primitive recursive in 
the rest of the functions Y. An example (with / — 1 ) will be given later in 
which "if <p v ..are primitive recursive, then 9 is primitive recursive” 
is true, but "9 is primitive recursive in 4>i». • • * <|h” is false (Example 2 § 55). 

Example 1 . In Example 1 § 44, 9 is primitive recursive in £, rq, 0, 
with X>, i)> 9, U\, 6 j, U\, Cf, 9 as a primitive recursive derivation. 

Our general result on explicit definition (§ 44) can be stated now thus. 

#A. A function 9 definable explicitly from functions Y and constants 
q v ..q s is primitive recursive in Y. 

By £ ty(x v .... x n ,y) we mean the sum of the numbers ^(x v ..x n ,y) 

V<z 

for all natural numbers y such that y < z, if z > 0 ; and 0 , if z = 0 . It 
is a function of x v ..., x n , z for any given function .., x n , y). By 

n <K*1 . x n ,y) we mean similarly the product of the numbers 

V<z 

ty(x v ..., x n , y) for y < z, if z > 0 ; and 1 , if z = 0 . 

#B. The finite sum £ ty{x lt ..., x n , y) and product II ty{x it ..x n ,y) 

y<z y<z 

are primitive recursive in 1 )/. 

Proof. The sum E ty{x lt ...,x n ,y) is given from tj'fo. y) by 

v<z 

the following recursion on z: 

S <!>(*!,..., *„, y) = 0 , 

v<0 

S I l>(x v ..x n ,y) = ..., x n , z) + E (J >{x v ...,x n ,y). 

V<z[ V<z 

Other finite sums and products reduce to these by explicit definition; 
e.g. E ^(y) = 2 <|/(y) = E tj >{y), S t|/(y) = £ ^{y+w'), 

y<z y = 0 y<z' w<y<z y<z — w' 

2 4«(y) = £ ^(y) = £ ty(y+ w ). 

wSv<z y = w v<z'—w 
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Although in this chapter we are developing an intuitive and not a 
formal theory, we shall sometimes wish the conciseness of expression which 
a logical symbolism affords, and in particular we shall have a need for 
such a symbolism in forming notations for predicates and functions. 
For these two purposes, we now introduce a new logical symbolism. This 
symbolism is to be taken as informal and meaningful, in contrast to that of 
the formal system as subject matter for metamathematics. An expression 
in the new intuitive symbolism is to be distinguished from a formula in 
the old formal symbolism by the differences in the symbols excepting 
and by context. This distinction between two symbolisms is introduced 
in this book (and in Godel 1931) for the named purposes, and is not an 
established usage in the literature. (Our intuitive logical symbolism, 
excepting “ = “(Ely)”, and the operators with ”x<y” etc., is the 

formal symbolism of Hilbert and Bernays 1934, 1939; and our formal 
logical symbolism, excepting and " 3 !y”, is that of Gentzen 

1934 - 5 •) 


Symbols in the 

Words in the 

Symbols in the 

intuitive symbolism. 

English language. 

formal symbolism. 

Q=R. 

Q is equivalent to R. 

Q ~ R- 

Q-*R. 

Q implies R (if Q, then R). 

QDR. 

Q&R. 

Q and R. 

Q&R. 

QVR. 

Q or R. 

QVR. 

Q- 

not Q. 

■'Q- 

(y)E(y). 

for ally, R{y). 

VyR(y). 

(Ey)R(y). 

there exists a y 
such that i?(y). 

3yR(y). 

(E\y)R(y). 

there exists a unique y 
such that R{y). 

3 !yR(y). 

{y) v < z R{y)- 

for all y < z, R[y). 

Vy(y <z => R(y)). 

(Ey)y<My)- 

there exists a y < z 
such that R(y). 

3 y(y<z &R(y)). 

w„<zR(y)‘ 

the least y < z such 
that R[y), if (Ey) y<z R{y ); 
otherwise, z. 



Similar notations are formed by using “(y)”, “(Ey)” and “py” with the 
inequalities “y<z”, “w<y<z”, “w<y<,z”, "u><,y<z’’,“w<y<z”. When 
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the indicated range of y is empty, the “(y)” expression is true, and the 
"( Ey )” expression is false. When the indicated range contains no y such 
that R(y), the value of the “fry” expression is the cardinal number of the 
range. 

In the present theory we often talk about the truth values ‘true’ 
(briefly t) and ‘false’ (briefly f) of propositions, instead of the propositions 
themselves. (Context will distinguish this use of “t” and "f” to stand for 
the truth values of propositions from the analogous use of them in §§ 28 , 
36 in a valuation procedure applying to formulas.) When we do this, we 
have immediately four types of functions, (a) Functions from { 0 , 1 , 2 ,...} 
to {0, 1,2,...}, called number-theoretic functions or briefly here functions. 
(b) Functions from { 0 , 1,2 ,...} to {t, f}, called number-theoretic predicates 
or briefly here predicates, (c) Functions from {t, f} to {t, f}, called truth- 
value functions or propositional connectives. We use five of them =, 

&, V, — , defined by the same tables as were given in § 28 for the respective 
formal operators 3 , &, V, -i. (d) Functions from {t, f} to ( 0 , 1, 2 , ...}. 
The function of this type which correlates 0 to t and 1 to f enters into the 
definition of ‘representing function’ given below. 

Of course, when the propositions are not being identified with their 
truth values, ‘predicate’ means propositional function of natural numbers 
(§ 31 ). Our practice of at times talking about the truth values t, f instead 
of the propositions calls for comment. In fact it is immaterial in many 
contexts whether we think of the values of the predicates as propositions 
or truth values t, f. This is because the essential mathematical meaning of 
the propositions comes from the definition of the predicates which take 
them as values. For example, consider the two propositions 3 <5 and 
3 < 5 . They are distinct propositions, differing in meaning. At first sight 
something appears to be lost if we identify them both with the one 
object t. However, if we identify the proposition 3 <5 with t, at the same 
time stating that this is the value of the predicate < for 3 and 5 as ar¬ 
guments, we express all the meaning of the original proposition. In other 
words, the proposition 3 <5 is synonymous with the proposition that 
the predicate <, interpreted as having its values in the domain {t, f}, 
takes the value t for 3 and 5 as respective arguments. (Moreover, here it is 
immaterial whether “t” and “f” mean ‘true’ and ‘false’ as we provide 
above,' or are simply any two distinct objects as in §§ 28 , 36 . The predicates 
under the two interpretations are isomorphic, so the abstract mathe¬ 
matical content of the proposition that the value of 3 <5 is t is the same.) 

In working closely with functions, we need to be aware of two meanings 
of the common functional notation, as noted in § 10 . For predicates. 
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there are three possible meanings (or six if we distinguish between propo¬ 
sitions and truth values). 

Meanings of “P(x v ..x n )”. Alternative notations. 

1. The predicate P{x lt .x n ). P, or . .x n P(x v ..., x n ). 

2 . The value of the predicate P No alternative notation, 
for x v ..x„ as arguments 

(the ambiguous value). 

3 . The proposition that P(x v ... ,x n ) (x x ).. .(x n )P(x v ..x n ). 
is true for all x v ..x n . 

These three senses correspond, respectively, to the name form (§ 31 ), 
conditional (§ 32 ), and generality (§ 32 ), interpretation of the free varia¬ 
bles Xj.x n in a formula P(x x ,..., x n ) of the formal system. 

Example 2 . The two statements 
f E( 0) (or_E( 0 ) = t), 

1 £(«') ^ E(a), 

define the predicate E(a) (= {a is even}) by recursion. — We can define 
a function e(a) by the primitive recursion 

I e(0) = 0, 

l s(a') = sg(c(a)) 

(cf. #9 § 44 ). Then E(a) = t(a)= 0 . 

We say that a function <p(x v . ..,*„) is the representing function of a 
predicate P(x v ..., x„), if cp takes only 0 and 1 as values and satisfies the 
equivalence 

P(x v ...,x„) = cp(x v .. .,x„)= 0 ; 

or in other words, when the values of P are given as t and f, if cp(* x ,..., x„) 
is 0 when P(x v ..., x„) is t, and cp^,..., x n ) is 1 when P(x v ..., x n ) is f. 

We say that a predicate P(x v ...,x n ) is primitive recursive, if its 
representing function <p(x lt ..., x n ) is primitive recursive (e.g. E(a) in 
Example 2 ). This definition follows Godel 1931. 

As another example, we list the equality predicate, the representing 
function being shown at the right (cf. ##10, 11). 

# 14 . a=b. sg | a — b |. 

We furthermore say that a function <p or predicate P is primitive re¬ 
cursive in predicates and functions Y, if the corresponding statement 
holds replacing the predicates among P, Y by their representing functions. 
Godel 1931 gave some theorems concerning primitive recursive func- 
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tions and predicates, which we state as follows (##C—E). The facts 
had also been obtained by Skolem 1923. 

#C. A predicate P obtained by substituting functions Xi» • • •» Xm for 
the respective variables of a predicate Q is primitive recursive in Xi» • • •» Xm> Q- 

Proof. If the given predicate is Q{y lt .. .,y m ) with the representing 
function • •> y»»)> an( i the functions substituted are Xi( x i>- • •« x n)> 

• • •> Xm(*i>' • •» x n)> the representing function of the new predicate 

Q(Xl( X l> • • • » x n)i • • • > Xm{ x l> • • • > ^n)) is • • •» x n)> • • • > X«*(^l> • • • > x n))‘ 

This function is primitive recursive in ij>, Xi»* • X™ by Schema (IV). We 
know from § 44 that no generality is lost by considering the substitution 
as of this particular form; and similarly in #D: 

#D. The predicate Q{x v ...,#„) is primitive recursive in the predicate Q. 
The predicates Q{x v ...,*„) V R{x v ...,x n ), Q(x v ...,x n )& R(x v ...,x n ), 
Q(x v ...,x n )-+ R(x v ...,x n ) and Q{x v ...,*„) = R(x lt ...,x n ) are primi¬ 
tive recursive in Q and R. 

Proof. Let the representing functions of Q(x v . .x n ) and R{x l> .. 
x n ) be <J/(x 1( .. x n ) and x(*i>. .x n ), respectively. Then the representing 
function of Q(x v ..., x n ) is sg(<K*i,.. x n )) (# 9 ), which is primitive 
recursive in <J/. The representing function of Q(x v ..., x n ) V R(x lt ..., x n ) is 
ty(x lt ..., x n )-y (x 1 ,..., x n ), which is primitive recursive in <]> and x- The 
rest of the theorem follows by known equivalences for = in 

terms of — and V (cf. Chapter VI, allowing for the differences in the sym¬ 
bolism). 

#E. The predicates ( Ey) y<z R(x v ...,x n ,y) and {y) v<z R(x v ...,x n ,y) 
and the function \xy v<z R{x\,..x n ,y) are primitive recursive in the predi¬ 
cate R. 

Proof. Let x(*i> • • • > x n> y) he the representing function of 
R(x v . .., x n , y). Then II y{x v ..x n ,y) is the representing function of 

y <z 

(Ey) y<z R(x v ..., x n , y). This is primitive recursive in x by #B. Simi¬ 
larly, sg(2 x( x v • • •» x n, y)) is the representing function of (y) I/<15 

V<z 

R(x v '. y) (# 10 ). We illustrate the proof for [iy y<z R{x v . y) 

with an example. Let the values of x v ...,x n be fixed, and write 
simply “x(y)” for x(*i> • • • > x »> V) with the fixed values of x v ...,x n . 
Suppose that 2 = 7 , and that for y = 0, 1, ..., 6 (first row below) x(y) 
takes the values shown (second row). 
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y 

x(y) 

*(y) = n x(s) 

«( y ) = s *(0 

£<1/ 


0 1 2 
1 1 1 
1 1 1 

0 1 2 


3 4 5 
0 1 0 
0 0 0 

3 3 3 


6 7 = z 

0 
0 

3 3 


The desired number \xy y<z R{y) is the least y (first row) < z for which 
R(y) is true, i.e. for which a 0 appears in the second row, if there is such a 
y. In our example, there is, and the least is 3 . This number also appears 
as the last number o(z) in the fourth row. The device illustrated will 
evidently work in any case. To change the example, if ( Ey) y<z R(y ), so 
that no 0 occurs in the second row, then a(z) will be z, which is what 
y.y y<z R(y) was defined to be in this case. The function 0(2) written out in 
full is 2 II x( x i>- • • > x n, s). By #B, this is primitive recursive in 7. 

t<Z 8<t 

In using these theorems, we may combine several applications into 
one step. By #14 with #C, , x n )—x(x v ...,x n ) is primitive 

recursive in 4 *. X', e.g. using § 44 , c'-\-a—b is primitive recursive. By 

##E, C and § 44 , (Ey) y<l ^ {Xl . Xn) R(x v ..x„, y) is primitive recursive 

in 4 > R) e.g. using §44 further, the following is primitive recursive. 

# 15 . a<b. a<b = {Ec) c<b [c’+a=b], or sg {a'—b). 

The inequality “y<z” in #E can be changed to “y<z”, "w<y<z”, 
“w<y<,z”, “w<y<z” or “w<Ly<,z "; for example, 

(y)w<y<z R ( X V • • ’ x n> y) = (y) v <z>^wR( X V-> X r» V + w). 

A set of predicates Q it . . Q m is mutually exclusive, if for each set of 
arguments not more than one of them is true (cf. § 3). 

#F. The function <p defined thus 

j ?!(*!,...,%„) if Q 1 (x 1 ,...,x n ), 

~ | ?m(* 1,. • X n ) if Q m (x 1,. ..,x n ), 

[ ¥rn+i(*i>< • •, Xn) Otherwise, 

where Q v .. ., Q m are mutually exclusive predicates {or y{x l: ..., x n ) shall 
have the value given by the first clause which applies) is primitive recursive in 
?!»•••, «Pm+i» <?!.•••, Qm- (Definition by cases.) 


Proof, for Q v ...,Q m mutually exclusive. First method. Let 
4 i,.. 4 m be the representing functions of Q v ..Q m . Then (omitting 
“(x 1 ,..., x n )” to save space) 

9 = S g(W • 9l + • • • + Sg(4m) • 9m + 4l • • • • • 4m * 9m+l- 
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Second method. 

9 = M'*<9i+...+*»+i (Qi & y=9 1 ) v ... V (Q m & y=f m ) V_ 

(<?!&... & Q m &y=<p m+1 ). 
Prime factor representation. Let the prime numbers in order of 
magnitude be p 0 , p x> p 2 ,..., Pi,... (i.e. p 0 =2, p t = 3 , /> 2 = 5 , ...). 
The fundamental theorem of arithmetic (Gauss 1801 ) states that a given 
positive integer a can be factored into a product of prime factors which is 
unique to within the order of the factors. Thus we have a unique rep¬ 
resentation of a of the form 


(1) a = p$>p<?-p?-... >p? ... {a # 0), 

where a, is the number of times />, occurs in a as factor (0 if p t is not a 
factor of a). We can regard the product (1) as extending indefinitely, 
all but a finite number of the exponents being 0. 

We now add to our list of particular primitive recursive functions 
and predicates. 

# 16 . a\b m a divides b. a\b= (Ec) c < b [ac = b], orsgrm(&,a). 

# 17 . Pr(a) ss a is a prime number. Pr(a) = a > 1 & (.Ec) 1<c<0 [c|a]. 

# 18 . Pi — the t'+l-st prime f p 0 = 2, 

number. \ p t , = <*< Pf ! + iPr(*), 

where the upper bound p t \ + 1 for a; is given by Euclid’s demonstration 
that to any p there exists a prime > p and <; p \-{-1 (§ 40 ). The com¬ 
bining of an application of #E with a primitive recursion is legitimate, 
as it merely condenses what could be accomplished by first introducing 
X(c) = y-x c<x < c i n Pr(*), and then writing the second recursion equation 

as Pf = X (Pi)- 

the exponent a f of p^ _ 

# 19 . (a)i = in ( 1 ), if a # 0 ; (a)< = [Oc x<a [p x .|a 8cpf ja]. 

0, if a = 0. 


We may write ((a),), as (a) {>j , (((a),),)* as {a) ii>k , etc. 


lh(0,a) =0, 

lh(* v , a) = { lf Pi ) a> 

\ lh(t,a) otherwise. 

lh(a) = lh(a, a). 


the number of non- [ lh(0, a) = 0, 

#20. lh<«) = vanishing exponents _ f B-M+lI if PM, 

in (1), if a#0; f lh(t,a) otherwise. 

0, ifa = 0. lh(a) = lh(a, a). 

We can represent the finite sequences a 0 ,..., a 8 of positive integers by 
the numbers a = pff -... ‘p a s *\ then lh(a) is the length s+1 of the se¬ 
quence represented by a. 

#21. a*b = a- n p\$ a)+i . 

i<lh{b) 
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Then if a = ptf-...-p? {a 0 ,...,a g > 0 ) and b = p\«- - -p\t 

{b 0 , .. b t > 0), a*b = p“' ■ ... • PT‘p b s+i • • • • • Ps+t+i- For an Y « and 
any such 6, a*l — a, 1 *b = b, 1*1 = 1. 

§ 46. Course-of-values recursion. In proving a theorem T(y) 
by induction, it may happen that the case T(y’) of the theorem depends 
not simply on the immediately preceding case T(y), but on one or more 
preceding cases. This kind of a proof by induction we have called a 
‘course-of-values induction’. It can be reduced to a simple induction, by 
first proving the lemma ( s) s < v T ($) by simple induction, after which the 
theorem follows by setting s — y (cf. *162a § 40). 

The analogous situation arises in definition by induction. The function 
value 9 ( 0 ) is given outright; and the function value (p(y r ) is expressed in 
terms of y and one or more of the preceding values 9 (s) for s <, y. The re¬ 
cursion is then called a course-of-values recursion. We shall see that it can 
be reduced to a primitive recursion by an analogous device (cf. Peter 
1934)- 

The two cases of the definition of 9 may be combined (cf. *162b), by 
saying that y(y) is expressed in terms of y and 9 (s) for s < y. When 
y — 0 , this means that 9 ( 0 ) is given outright, since the set of values 
9 (s) for s <y is then empty. 

More generally, let the function to be defined be 9 (y, x 2> ..., x„) where 
x 2> ..., x n are parameters (remaining fixed throughout the recursion). As 
an auxiliary function, we introduce 

(0 ?(y= n pf itX . x » ) , 

i<y 

called the course-of-values function {in y) for the given function 
9 (y, 

Given the sequence of the values 9 (s, x 2 ,..., x n ) of our original function 
for s < y, by ( 1 ) we obtain the value 9 (y‘, x 2> ..x n ) of the course-of- 
values function. Conversely, given ?(y; x 2> ..., x n ), we can extract all the 
values 9 (s, x 2> ..x n ) for s < y with the help of #19 thus, 

( 2 ) 9 (s, x 2 ,...,x n ) = ($(y; x 2> . . x n )) t if s < y. 

So in a sense the knowledge of the value 9 {y\x 2 ,..., x n ) of the course- 
of-values function is equivalent to the knowledge of the sequence of 
values 9 ( 0 , x 2 , .... x n ), ..., 9 (y— 1 , x 2 , ..., x n ) of the original function. 

#G. If 9 satisfies the equation 

(3) 9 (y, x 2 ,..., x n ) = x(y, ?(y; x 2 ,..., x n ), x 2 ,..., x n ), 
then 9 is primitive recursive in y . 
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Proof. First we set up a primitive recursion for 9 , 

f ?(°; *»•••»*«) = 

’ l <?(y';x 2 ,...,x n ) = ?(y; .*»>-*».*«>. 

Then we obtain 9 from 9 by the explicit definition, 

(5) 9 (y, x 2 ,..., x n ) = (9 (y'; x 2 ,..., x n )) v . 

Example 1 . Let 

(a) 9 (y) = n (y + 9 ( 5 )). 

s<y 

The sequence of the values of this function, and of its course-of-values 
function, are as shown. 

y 0 12 3 4 ... 

9 (y) 1 2 12 300 145920 

9 (y) 1 2 1 2 1 -3 2 2 x -3 2 -5 12 2 1 -3 2 -5 12 -7 800 ... 

Note that the last exponent in 9 (y') is always the value of 9 (y); e.g. 
( 9 ( 3)) 2 = 12 = 9 ( 2 ). To apply #G, note that by (2) 

(b) 9(y) = n (y + (9(y)) s ). 

8<V 

This is of the form (3), and by ##1, 19, B and G, 9 is primitive re¬ 
cursive. 

This version of #G accomplishes the reduction of course-of-values 
recursion to primitive recursion, for cases when the course-of-values re¬ 
cursion is already given in the form (3) of a dependence of 9 (y, x 2 ,..., x n ) 
on the number 9 (y; x 2 ,..x n ) besides on y, x 2 , ..., x n . 

We illustrate further how to reduce course-of-values recursions not 
already so given to the form (3). 

Example 2. recursion from a double basis. 

9(0) = <h> 

(a) J 9(0 = <lv 

. 9(y") = x(y> 9 (y)> <?(y'))- 

First we restate this in the more compact form for course-of-values 
recursion (using ## 6 , F), thus 

q 0 if y = 0 , 

(b) 9 (y) = | if y = 1 , 

x(y- 2 , 9 (y— 2 ), 9 (y — 1 )) otherwise. 

Then we express 9 (y—2), 9 (y—1) as (9 {y)) y + 2 , {9(y)) y ^ v respectively. 

The method applies also to definitions of predicates by course-of-values 
recursion. 
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Example 3 . Consider the equivalence 

(a) T(y) = y= 23 V V{y) V [y= 2 17 • 3 '">‘ • 5 ( ^ & T((y) 1 ) & T((y) 2 )] V 

[y = 2 19 - 3 (w)l • 5 (y)2 & T^) & T((y) 2 )] V [y= 2 “- 3 ( * ,)l & ^(y^)], 

where F is a given predicate. This defines T(y) by course-of- values in¬ 
duction on y. For when y=0, all disjunctive members on the right are 
false except perhaps the second; so T( 0 ) = F(0). When y > 0, (y)! < y 
and (y) 2 < y. 

Thus T(y) is expressed in terms of y, V and T(s) for s < y only. Let 
-r(y) be the representing function of T(y) ; and let T{(y)^j, T{(y) 2 ) in the 
right member of (a) be expressed as (r(y)) (v)l =0, (v (y)) (l/)2 =0, re¬ 
spectively. Now the definition of T has the form 

(b) T(y) ^ R(y, r(y)) 

where, by # # 2 , 3 , 14 , 19 , A, C and D, R(y, z) is primitive recursive in V. 
This means simply that the representing function p of R is primitive 
recursive in the representing function o of V; so we have an equation of 
the form 

(c) x(y) = p(y, x(y)) 

where p is primitive recursive in o. By #G, t is primitive recursive in p, 
and hence in u; i.e. T is primitive recursive in V (and is primitive recur¬ 
sive if V is). 

In these examples we do always succeed in reducing the given course-of- 
values recursion to the form ( 3 ) by use of ( 2 ). Closer examination in § 47 
will show why, and enable us to formulate a version of #G which in¬ 
cludes that reduction. 

Example 4. simultaneous recursion. The function values <p x (y) 
and <p 2 (y) are expressed in terms of y and values <p 1 (s) and cp 2 (s) for 
s < y. Reduce to #G by using as auxiliary function 

<p(y) = 2'P l(v) 3 < P i(v) . 


*§ 47 . Uniformity. In ##A — G (considering for the moment only 
functions and not predicates), our concern was not primarily with any 
particular functions 9 and Y, but with methods by which a function 9 is 
defined from 'F as unspecified functions. In showing for a particular 
such method that a function 9 of n variables is primitive recursive in ¥*, 
our applications of the schemata (I) — (V) did not depend on what 
functions ¥ are, so long as their number l and the respective numbers 
tn v ..., m l of arguments which they take is fixed. In other words, we 
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gave a primitive recursive derivation schema of 9 from Y, with a fixed 
analysis. The definition of ‘analysis’ is analogous to that given in § 20. 
For an application of Schema (II) it includes the specification of the n 
and q ; for (III) of the » and i ; etc. (Analogously we often obtained derived 
rules for the formal system by exhibiting a ‘deduction schema’ with 
metamathematical letters standing for unspecified formulas, variables, 
etc.) Under these circumstances, we say that 9 is primitive recursive 
uniformly in Y. 

We can also explain this uniformity notion as follows. For a particular 
method of defining a number-theoretic function 9 from number-theoretic 
functions Y, we can write 9 = F(Y) to express the fact that what 
function 9 shall be is determined by what functions Y are. Then F is a 
fixed mathematical function of higher type, namely one from l number- 
theoretic functions Y of tn v ..., m, variables respectively to a number- 
theoretic function 9 of n variables. We call such a function F a scheme 
function or schema or scheme or functional. We can also write y{x v . ..,*„) 
= F(Y; x v ..., x n ) (with the same F) to express the fact that (by the schema 
F) what natural number y(x v ..., x n ) shall be is determined by what 
functions Y are and what numbers x v ..., x n are. 

For any fixed n and m, Schema (IV) constitutes a functional, which 
we have already denoted by S”. For fixed n (and when n — 1, fixed q) 
Schema (V) constitutes a functional R* or R n . The other three schemata 
(I) — (III) define particular number-theoretic functions S, C”, U'f (or 
constitute functionals with 1 = 0 ). 

Now we say that a functional (or schema) 9 = F(Y) is primitive 
recursive, or that 9 is primitive recursive uniformly in Y, if F is definable 
explicitly from the functionals S”, R*, R” and constants S, C”, U”. 

Example 1 . In Example 1 § 44, 9 is primitive recursive uniformly 
in C, v), 6. Under the first version of the definition of uniformity, we see 
this from the fact that the analysis of the primitive recursive derivation 
schema C 7), 0, U\, 0 1( U\, C\, 9 (consisting of the explanations opposite 

1 — 9 at the right) is fixed. Under the second version, we see it from the 
fact that (b) expresses 9 — F(£, tj, 0 ) explicitly in terms of S3, S|, Sj and 

U\, u\, Cl 

Sometimes a method for determining a function 9 from functions Y 
is specified only under some restriction on the Y’s. To establish uniform 
primitive recursiveness, we show then that there is a fixed succession of 
applications of Schemata (I) — (V) which leads from Y to the same func¬ 
tion 9 as the given method, for any Y to which the given method applies 
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(or is intended to apply). The succession of applications of Schemata 
(I) — (V) then leads in fact to some function 9 from any Y, since each 
of (I) — (V) has this property. Thus: The given method constitutes a 
functional 9 = F(Y) defined only for a restricted range of Y. The suc¬ 
cession of applications of (I) — (V) constitutes a completely defined 
functional 9 = F^Y), which is primitive recursive and such that F X (Y) 
= F(Y) on the range of definition of F. 


Example 2 . Let a function 9 be defined thus, 


(a) 


?(#) = 


9i( x ) ^ t h(*)=°> 
(f> 2 (x) if (J/ 2 (*)=0, 
9 3 (x) otherwise, 


where <p v <p 2 , 93, ^ 1( <J / 2 are given functions such that, for each x, fy x {x) and 
4*2 (x) are each either 0 or 1 and not both 0 . Then we can write 
(b) 9(A) = sg^*))-?^*) + sg(<J>,(*))-9,(*) + 


and conclude that 9 is primitive recursive uniformly in 9^ 9 2 , 93, ^1, ^2 
(cf. the first proof of #F § 45 ). The 9 of (a) was only defined for ij/j, 
satisfying the restriction stated; but (b) defines a 9 without any restric¬ 
tion, which is the same 9 as the former when the restriction is satisfied. 


For schemes involving predicates, we say that a function 9 or predicate 
P is primitive recursive uniformly in predicates and functions Y, if the 
corresponding statement holds replacing the predicates among P, Y by 
their representing functions. The interpretation just explained applies 
when the functions introduced as representing functions of the predicates 
among Y are then treated as unrestricted function variables in applying 
the schemata (I) — (V). 

Using the explained interpretation, we can say that a function 9 (or 
predicate P) is primitive recursive uniformly in Y, even when some of the 
Y's are particular functions (or predicates). Then if any of those Y’s 
which are particular are primitive recursive, 9 (or P) is primitive re¬ 
cursive uniformly in the rest of the Y’s. 

If 9 is primitive recursive uniformly in 0, Y as function variables, and 
we then take 0 to be 0*, the resulting function 9* is primitive recursive 
uniformly in 0 *, Y (and hence if 0 * is primitive recursive, in Y). This 
principle works whether 0* is a particular function, or a function variable, 
including the case that it depends on additional number variables 
c v ...,c v as parameters. The principle is stated accurately as Lemma I. 
To make it clear how many (and which) independent variables the 
functions have, we write 0 = Xs l .. ..., s g ) (a function of q varia- 
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bles), 6* = Xs : . .. s v Cy .. .c p 0*(s 1( ..., s v , c v .... c„) (a function of q+p 
variables), and Xs x .. .s, 0*(s 1 ,.. s Q , c v .. c„) for the function of q 
variables s lt . .s„ which we get from 6* whenever c v ... ,c p are fixed 
numbers. 

Lemma I. Given a functional cp = F(0, Y) as follows 

9 (^ 1 * • • •» X n ) ~ ^(^1 • ■ • $q 0($i • • • Sq) > ^ > Xy, • • • > X n ) > 

let a functional 9 * — G(0*, Y) be defined as follows 

9 *(*i>- c v ..., c P ) = 

G(Xs^ . • • SflCi. . . Cp 0*(s^,. . ., Sq, Cy,. . ., C jfj, Y*, X . • •, X n , Cy, • • • 1 Cp) === 

F(XSj,, .Sq 0*(si,. .., Sq, Cy, ..., c p), Y*, x , x n ). 

If F is primitive recursive, so is G. 

Proof. The essence of the proof is that both explicit definitions and 
primitive recursions remain such when parameters are introduced. 

To give the proof in more detail, we use course-of-values induction on 
the length k of a primitive recursive derivation schema <p 1; ..., 9 * of 
9 from 0, Y. Seven cases arise according as 9 (= cp k ) is 0, or one of the Y’s 
(say <];<)» or an initial function by Schema (I), (II) or (III), or an im¬ 
mediate dependent of preceding functions by Schema (IV) or (V). 

Case 6 : <p(x v . ..,x n ) = <p(xi(xy,.. .,x n ),..., Xm{xy,. • •. *»)), where 
Xi>- • •» Xm precede cp (= cp k ) in 9 ^..., 9 *. Then 

9*(*1> • • • > X n , Cy, ..., c P ) = 

4*(Xl ( Xy, • • • > X„, Cy, . . ., Cp), . . ., X m {Xy ,. . ., X n , Cy, . . . , Cp), Cy, . . . , C„). 

By the hypothesis of the induction, tjj*, y*,. • - >Xm are primitive re¬ 
cursive uniformly in 0*, Y. By #A, 9 * is primitive recursive uniformly 
in ^*, Xi»- • •. Xm'< and hence, in 0*, Y. 

Example 3. Not every number-theoretic function is primitive re¬ 
cursive. (Why? Cf. §§ 1 , 2 : Is every real number algebraic?) Let 5(c) be a 
particular function which is not primitive recursive. Let 9 be defined 
from an unspecified function 0 thus, 

90 = 5(8(0)). 

Then, for each particular 0, the resulting cp is a constant function, and so 
is primitive recursive, by an application of Schema (II) with n = 1, 
q = 5(0(0))'. A fortiori, for each 0, 9 is primitive recursive in 0, with C* 
for q = 5(0(0)) as a primitive recursive derivation of 9 from 0. But, 
because the analysis of this derivation depends on 0 , we cannot conclude 
that 9 is primitive recursive uniformly in 0 . Indeed if it were, by Lemma I 
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taking U\(s, c ) (which is primitive recursive) as the 0 *(s, c), the resulting 
function <p*(x, c) would be primitive recursive, and hence so would be 
<p*( 0 , c). But cp*(0, c) = 1 {U\ ( 0 , c )) = — Thus also Lemma I does not 

hold, if the hypothesis that F is primitive recursive, i.e. that 9 is primitive 
recursive uniformly in 0 , Y, is weakened to: 9 is primitive recursive in 
0 , Y, for each 0, Y. — In this example, of course cp is primitive recursive 
uniformly in £, 0. 

Since the former proofs actually establish uniformity: 

—G (second versions). Reread the original versions with ‘‘prim¬ 
itive recursive uniformly” in place of ‘‘primitive recursive”. 

Usually a course-of-values recursion arises in the following form. The 
ambiguous value cp(y, x 2 ,..., x n ) is given in terms of y, x 2 ,..., x n , other 
functions and predicates Y, and <p(s, x 2 , ..., x n ) as a function of s for the 
given x 2 > ..x n . The expression by which it is given is the result of sub¬ 
stituting cp (s, x 2 ,..., x n ) for a function variable 0(s) of a primitive re¬ 
cursive functional. This functional has the property that its value is 
not changed, if values of 0 (s) are changed for s > y only. In other words, 
there is a primitive recursive functional F(As 0 (s), Y ;y, x 2 ,..x n ) such that 
(6) cp(y, * 2 ,. ..,x n ) = F(Ascp(s, x 2> . Y; y, x 2 ,..., x n ), 

F ( Xs M s ). Y; y, = F(Xs 0 2 (s), Y; y, * 2 ,..., x„) 

whenever 0j(s) = 0 2 (s) for all s < y. 

Under these circumstances, we say that cp(y, x 2 ,..., x n ) is primitive re¬ 
cursive uniformly in cp(s, x 2 ,..., x n ) for s < y and Y. 

Like terminology is used for predicates (reading “P”, “H”, “=" in 
place of “9”, “0”, “ = 

In case we are considering the definition of 9 from Y only for a restricted 
range of Y, then ( 7 ) as well as (6) need only hold on this range. 

Example 4 . Let 9(y, x) be defined by 

(a) 9(y, x) = y • p( 9 (ff(y). *)) + x ) I y] 

where p, a are given functions such that a(y) < y for y > 0 . To see that 
9(y, x) is primitive recursive uniformly in 9(s, x) for s < y and p, cr, let 
us insert an unspecified function Xs0(s) in place of Xs9(s, x) in the right 
member of (a), for convenience calling the resulting function '/i(y, x ): 

(b) Xi(y- x) = y • p( 0 (o(y))) + \iz z<y [Q{z) \ y]. 

By # #A, C, E, 16 (using the second versions of A, C and E), y_ x {y, x) is 
primitive recursive uniformly in 0, p, a; and changing values of 0(s) for 
s > y only will not change the value of y A {y, x) under the restriction on a. 
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Example 5 . We can see directly from (a) of Example 1 § 46 ((b) 
of Example 2 § 46 ) that <p(y) is primitive recursive uniformly in ep(s) 
for s < y (in <p(s) for s < y and y). Instead of first writing “ 6 ” in place of 
“9”, we need merely to examine how the right member is constructed 
out of <p(s) regarded momentarily as an unspecified function. Similarly 
from (a) of Example 3 § 46 , T(y) is primitive recursive uniformly in T(s) 
for s < y and V. 

#G (third version). If 9 (y, x 2 ,..., x n ) is primitive recursive uniformly 
in 9(s, x 2 ,. .x n ) for s < y and Y, then 9 is primitive recursive uniformly 
in Y. Similarly for a predicate (reading "P” in place of “9”). 

Proof, for a function 9. By (6), ( 7 ) and ( 2 ), 

,g, 9(y, * 2 ,= F(Xs (y(y;x 2 ,..., x n ))„ Y; y, x 2 > ..x„) 

= x(y>?(y>x 2 ,...,x n ),x 2 ,. ...x„) 

where 

( 9 ) X(y> c - x 2 >• • •. *») = F(^s (c) s , Y; y, x 2 ,..., x n ). 

By Lemma I, y is primitive recursive uniformly in Xsc (c)„ Y; and hence by 
# 19 , in Y. Now the second version of #G applies. 

The result for a predicate P follows by going over from P to its rep¬ 
resenting function. 

Remark 1. Cf. Remark 1 end § 44 . If 9, Y are functions of n, m v .. 
W; > 0 variables, then 9 is primitive recursive uniformly in Y under Basis 
B, if and only if under Basis A. For any primitive recursive derivation 
schema of 9 from Y under Basis A can be transformed into one under 
Basis B as above by supplying a description under Basis B of C” (of C®) 
for each application of (II) (of (Va)). Conversely, given a primitive re¬ 
cursive derivation schema <p v ..., 9* of 9 from Y under Basis B, one 
under Basis A can be obtained by the following process. Say that n = l = 
m x = 1, i.e. that the derivation is of y(x) from #(y). Let a parameter c be 
introduced into each of the functions <p v ..., 9*. Similarly to the proof of 
Lemma I, we can then obtain a primitive recursive derivation schema of 
9(x, c) from <\>(y, c) under Basis A. To this we prefix #(y, c) = p(U\(y, c )) 
and suffix 9(x) = 9 (U\(x), Cj(x)). — For example, if 9(0) = ^( 0 ), 9(y') = 
X(y, 9(y)), then 9 is primitive recursive uniformly in #x under Basis B 
(using successively (II B ), (IV) with n — 0 , (Vb) with n = 1). Hence it is 
also under Basis A. 

§ 48. Godel’s (3-function. The second problem of this chapter 
is to show that every primitive recursive predicate is numeralwise 
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expressible in the formal system of Chapter IV, even though that 
system has function symbols only for the three functions +» '• 
We shall prove this in the next section, following a method of Godel 
(1931, 1934). 

This proof is not essential to our program of formalizing number theory. 
If it did not succeed, we could have arranged instead that recursion 
equations for other functions besides + and • should be axioms of the 
system. Indeed, by an enumerably infinite system of particular number- 
theoretic axioms, we could include recursion equations for all the primi¬ 
tive recursive functions. However it is of some interest that a finite system 
suffices, the more so that we can get along with the two chief functions + 
and • of traditional arithmetic, when taken with the logical constants 
and the predicate =. 

Godel has called a predicate arithmetical, if it can be expressed ex¬ 
plicitly in terms of constant and variable natural numbers, the functions 
-f- and *, equality =, the operations &, V, — of the propositional cal¬ 
culus, and the quantifiers ( x) and (Ex), combined according to the usual 
syntactical rules. (This uses the adjective ‘arithmetical’ in the narrower 
sense, § 9.) 

The reader may readily give the definition more fully as an inductive 
definition, paralleling the definition of formula for the formal system. 
The arithmetical predicates are precisely those which can be expressed by 
name forms in the formal system under the usual interpretation of the 
symbols. (By comparison with the formal treatment in §§39 and 41, 
a<b and rm(c, d)=w are arithmetical.) 

But, using the intuitive symbolism, we shall keep the discussion 
informal for the present. For the application to primitive recursive pred¬ 
icates, we shall require only constructive use of the quantifiers. 

In the next section, we shall need a method of dealing with finite se¬ 
quences a 0 ,...,a n of natural numbers arithmetically; there we cannot use 
the functions a b , p t and (a), of §§ 44, 45 with which we handled finite 
sequences primitive recursively in §§ 46, 47. 

We know that the predicate rm(c, d)=w, where rm(c, d) is the remain¬ 
der when c is divided by d, is arithmetical. 

A set of positive integers d Q ,...,d n are said to be relatively prime, if 
no two of them have a common positive integral factor except 1. For 
example, 3, 4, 5 are relatively prime. 

Consider the n -\-\-tuples of the values of the function rm(c, d), for a 
fixed n 1 -tuple of relatively prime divisors d 0 ,..., d n , as c increases. 
For example (with n = 1), if d 0 = 3, d x = 4, they are as follows. 
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c 0 1 2 3 4 5 6 7 8 9 10 11 12 13 ... 

rm(c, 3) 0120120120 1 2 0 1 ... 

rm(c, 4) 012301230 1 2 3 0 1... 

We see that, as c ranges from 0 to 11, the pair of remainders rm(c, 3), 
rm(c, 4) assumes each of the 12 possible ordered pairs of numbers a 0 , 
a x for a 0 < 3, a x < 4. 

To establish this in general, let rm(c, d 0 ), rm(c, d x ), .... rm(c, d n ) take 
the respective values a 0 , a x ,...,a n for c = j and also later for c = j+k. 
Since j and j+k give the same remainder a { on dividing by d { (i = 0 , 
..., »), their difference k must contain d { exactly; say that k = b^. Thus 
k — b 0 d 0 — b x d — ... — b n dft. 

Here k has each of d 0 , d x ,..., d n as factor. Since by hypothesis d 0 , ..., d n 
are relatively prime, by the fundamental theorem of arithmetic (§ 45) 
k must be a multiple of their product d 0 - d x - ... • d n . 

Therefore the ordered n + 1 -tuple rm(c, d 0 ), rm(c, d x ), .... rm (c,d n ) 
cannot return to a given sequence of numbers a 0 , a v ..a n after less 
than d 0 ‘ d x - ... -d n consecutive values of c. But there are exactly d 0 • d x • 
... • d n distinct sequences of numbers a 0 , a x , ..., a n for a 0 < d 0 , a x < d v 
..., a n < d n . Each sequence is therefore taken once in any d 0 - d x - ... • d n 
consecutive values of c. 

Following Godel 1934 , we use this fact to construct a function (3(c, d, i) 
with the two properties, 

( 1 ) the predicate | 3 (c, d, i) — w is arithmetical, and 

( 2 ) for any finite sequence of natural numbers a 0> a x ,..., a n , there can be 
found a pair of natural numbers c, d such that 

£(c, d, i) = a t (t — 0 , 1 

As we know, a number c (c, < d 0 • d x •... • d n ) can be chosen so that 
rm(c, d ( ) = a,- for i = 0 , 1 , .... n, provided d 0 , d x ,..., d n is a set of 
numbers such that (a) d 0 , d x , ..., d n are relatively prime, and (b) a 0 < d 0 , 
a x < d x , ...,«„ < d n . Our problem will be solved, if we can obtain the 
numbers d 0 , d x , ..., d n as the values of a function $(d, i) for i — 0 , 1 , ..., « 
and a suitable number d, so that 

(i) P(c, d, i) = rm(c, 8 {d, i)) 
also- satisfies ( 1 ). 

Now (1)' will be satisfied if we take 

(ii) B(d,i) = l+(t + l)^. 

For rm {c,d) = w is arithmetical, and 8 {d,i), which we substitute for 
d to get p(c, d, i)—w, is defined explicitly from 1 , + and •. 
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For the given sequence of numbers a 0 , a v ..., a„, let s be the greatest 
of n, a Q , a v .. ., a„, and take d = s!. 

Then, (a) the numbers d- — S(d, i) for i — 0 , 1 , ..n are relatively 
prime. For if two of them l-f(/-j-l)s! and 1 + (/+£+1 )s! had a 
factor other than 1 in common, they would have a prime factor p in 
common, and this factor p would divide the difference which is &-s!. 
But p cannot divide s!, since then it would divide (/-j~l)s!, which is 
impossible since it divides 1 -f- (/-f- 1 )s!. Then also p cannot divide k, 
since k < n < s and every number < s divides s!. Hence p cannot divide 
k-s '.; so by reductio ad absurdum (a) is proved. 

Moreover, (b) for each i (i — 0, 1, ., n), a t < s < s! < l + (f+l)s! 
= 8 (d, i) = d t . 

Julia Robinson 1949* shows that the predicate | (if 16 ) and function ' can 
be used in place of the two functions + and • in defining the arithmetical 
predicates; and Church and Quine 1952 show that a suitably chosen sym¬ 
metric 2-place predicate can be used instead. 

§ 49 . Primitive recursive functions and the number-theoretic 
formalism. Theorem I. If y(x lt ..., x n ) is a primitive recursive func¬ 
tion, then the predicate <?{x v ..., x n ) -=w is arithmetical. (Godel 1931.) 

Proof, by course-of-values induction on the length k of a given 
primitive recursive description . .., <p k of 9 (cf. § 43 ). The cases (I) — (V) 
correspond to the five schemata by which o k , i.e. 9, may occur in the 
description. (For a proof with a similar case structure, cf. that of Theorem 
1 § 21 .) 

Case (I): 9(A) = x’. Then cp(x) — ze w—x-\-\, and w=x -\-1 is arith¬ 
metical. 

Case (II): 9^, ..., x n ) = q. Then 9(34, .... x n ) — w = w=q. 

Case (IV): cp(x v ...,x n ) = J(-/i(*i, ..., x„), .... x m (x v .... x n )), 
where by the hypothesis of the induction, ifvj, . .., — 

y A {x v . . .,x n )=y v .... x m {x v ■ ...x n )=y m are arithmetical. 

Then y{x lt . ,.,x n )=w = (EyP) . . . (.Ey m )[y A {x lt . ..,x„)=y 1 & ... & 
lm{x 1, .. x n )=y m & 4»(y lf . ..,y m )=w]. 

Case (Vb): 9(0, x 2 , ..., x„) = 'J(.r 2 , ..., x n ), 9(y\ a 2 , ..., x„) = 

7,{y. ?(y, X 2 , ...,x n ),x 2 , ..., x n ), where p{x 2 -- x n )=w and y[y, z,x 2 ,..., 

x n ) ~w are arithmetical. Suppose that y,x 2 , . . . ,x n , w are numbers such that 
?(>'» x 2 , ..., x n )=w is true. Then there is a finite sequence of numbers 
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« 0 . ® 1 * • • * > 

(the values of y(i, # 2 ,..., x „) for i = 0 , 1 , ..y) such that 

«0 = • •> *«). 

^1 == «0. ^2> • • • i x n)t 

a 2 = X(1 > a l> x 2 > • • • ’ X n)> 

“ x(y ^ > dy—1 ’ X 2> • • • i X n ) > 

W = fl„. 

But then there are numbers c, d for Godel’s (3-function such that (3(c, d, i) 
— a-i {i — 0, 1, .. y), and the facts (A) can be expressed using the 
£}(c, i i, i)’s instead of the a/s, thus: 

(Ec)(Ed){fi(c, d, 0) = <p(x 2 ,...,x n ) 

(B) & (*)[*■<y p(c, d, t+1) = x(b P(c, d, i), x 2 , ..., x n )] 

&w = (3(c, d, y )}. 

Conversely, if (B) is true, then for any c and i given by (B), the numbers 
(3(c, d, i) for i — 0, 1, ...,y do constitute a sequence a 0 , a 1( .... a y 
satisfying (A); and (A) implies that 9 (y, x 2 , ..., x n )=w. Thus 
<p(y, % 2 > .. •, x n)= u ' is equivalent to (B). But (B) is an arithmetical 
predicate of y,x 2 , .... x n , w, as we see by rewriting it in the form 

(Ec)(Ed){(Eu)[${c, d, 0 )=u & ^(x 2> .. .,x n )=u] & (*)[>'<y-> 

(C) {Eu)(Ev)[$(c,d,i+l)=u & ${c,d,i)=v & y&i,v,x 2 , ...,*»)=«]] 

& p(e, d, y)—w), 

and taking into account the hypothesis of the induction and the arithmet¬ 
ical character of (3(c, d, i)—w and i<y. 

The analysis of primitive recursion in terms of finite sequences of 
natural numbers used here is a number-theoretic adaptation of Dede¬ 
kind’s analysis of primitive recursion (1888). 

Corollary. Every -primitive recursive predicate P(x v ..., x n ) is 
arithmetical. 

For P{x v ...,x n ) = f(x v ..., *„)—0, where 9 is the representing 
function of P (§45). Conversely, the theorem follows from its corollary 
using # # 14, C. The theorem however has the form necessary to the proof 
by induction on the length of the description 9 1( ..., 9*.. 

By translating from the intuitive arithmetical symbolism into the 
formal symbolism, we obtain a formula P(x 1( ..., x„, w) which expresses 
<p(x v ..., x n ) — w under the interpretation of the formal system. By this 
means we shall now prove the following theorem. 
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Theorem 27. Every primitive recursive function <p(x lt .... x n ) is 
numeralwise representable (§41) in the formal system of Chapter IV; i.e. 
there is a formula P(x 1( . .x„, w), containing no variables free other than 
the distinct variables x x ,..., x„, w, such that, for each n-tuple of natural 
numbers x v .... x n , 

(v) if <p(x 1 , ..., x„)=w, then \- P^, . .., x n , w), and 

(vi) f- 3!wP(x 1 , .. x n , w). 

Proof. The construction of P(x x ,. . ., x„, w), and the proof of the 
theorem, are by a course-of-values induction on k, with cases (I) — (V) 
corresponding to those in the proof of Theorem I. 

Case (Vb). By the hypothesis of the induction, there are formulas 
Q(x 2 , .... x n , w) and R(y, z, x 2 ,. .., x„, w) which numeralwise represent 
the respective functions ty(x 2 , . .., x n ) and yjy, z, x 2> . . ., x n ), i.e. they have 
the properties for these functions corresponding to (v) and (vi) for 

<?( x v • • • i x n)- 

According to *(180) §41, Godel’s ^-function p(c, d, i) [=rm(c,l + 
(t+ l)i = rm(c, (*'• <f)')] is numeralwise represented by a formula 
B(c, d, i, w) having a further property *180a. 

The formula 

3c3d{3u[B(c, d, 0, u) & Q(x 2 , . . ., x n , u)] & Vi[i<y D 3u3v[B(c, d, i', u) 
& B(c, d, i, v) & R(i, v, x 2 , ..., x„, u)]] & B(c, d, y, w)} 

shall be the formula P(y, x 2 ,..., x„, w) to represent 9 (y, x 2 ,..x n ). We 
must show that it has the properties (v) and (vi). 

To establish (v), let y, x 2 ,..., x„, w be numbers such that 9 (y, x 2 ,..., x „) 
= w. Then there are numbers a 0 , a lt ...,a y as in the proof of Theorem I, 
and also numbers c and d for these a 0 , a v .,a y such that (3(c, d, i) = a t 
(i — 0 , 1 y). By the property (v) for B, Q and R, the following state¬ 
ments hold: 


1- B(c, d, 0, a 0 ), 

h Q(x 2 , ...,x n , a 0 ), 

h B(c, d, 1, dj), 

h R(0, a 0 , x 2 , ..x 

h B(c, d,y, a v ), 
h B(c, d,y, w). 

1- R(y—1, a u _ v x 2 , . 


We can thence show that h P(y, x 2 ,..., x n , w) by &-introd., 3-introd. 
and *166 § 41. 

To establish (vi), we use an intuitive induction on y. Ind. step. Let 
w = 9 (y, x 2 , .... x n ) and u = 9 (y' f ^ 2 , ..., x„) = jjy, w, x 2 , 
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By (v) and (vi) for R (and for brevity using the informal presentation, 
beginning § 38): (a) R(y, w,x 2 ,.... x n , a), and (b) 3!uR(y, w,x 2 ,..., x n , u). 
By (v) (as already established for P): (c) P(y, x 2 , .... x„, w), 
and (d) P(y', x 2 ,..., x n , u). By the hypothesis of the induction on y : 
(e) 3!wP(y, x 2 , ..., x n , w). We must prove 3!wP(y', x 2> ..x n , w). 
Assume: (f) P(y', x 2 ,..., x„, w). By * 170 with (d), it will suffice to deduce 
a=w with w held constant. For &- and 3-elim. from (f), assume: 
(g) 3u[B(c, d, 0, u) & Q(x 2 , .. .,x n) u)], (h) Vi[i <y' Z> 3u3v[B(c, d, i', u) 
& B(c, d, i, v) & R(i, v, x 2 , ..x n , u)]], and (i) B(c, d, y', w). From (h) 
using *138a §39 (or *167 and *166): 

(j) Vi[i<y D 3u3v[B(c, d, i', u) & B(c, d, i, v) & R(i, v, x 2 , ...,x n ,u)]], 
and (k) 3u3v[B(c, d, y', u) & B(c, d, y, v) & R(y, v, x 2 , .... x n , u)]. For 
&- and 3-elim. from (k), assume: (1) B(c, d, y', u), (m) B(c, d, y, v), and 

(n) R(y, v, x 2 , .... x fl , u). From (g), (j) and (m) by &- and 3-introd.: 

(o) P(y, x 2 , ..., x„, v). From (o), (c) and (e) by *172, v=w, which with 
(n) gives: (p) R(y,u>,x 2 , .. .,x„,u). From (p), (a) and (b) by *172 u=u, 
which with (1) gives: (q) B(c, d,y',u ). From (q), (i), * 180a and *172, u=w, 
as was to be deduced. 

Corollary. Every primitive recursive predicate P{x v .... x n ) is 
numeralwise expressible in the formal system. 

For if P(x 1 , ..., x n , w) numeralwise represents the representing 
function <p of P, then using (vii) § 41 (obtained there by *173, *(164)), 
P(xj, .... x n , 0) numeralwise expresses P. 

Lemma 18b. Theorem 27 and Corollary hold for Raphael Robinson’s 
formal system (§§41, 76) consisting of the predicate calculus with thirteen 
particular number-theoretic axioms as follows: Axioms 14 — 21, and (the 
formulas of) *104 — *107 and *137 {or *136). 

Using Lemma 18a §41. 

Remark 1 . A more ambitious undertaking (relating to the full 
system, not to Robinson’s) would be to establish the provability of 
formulas which express the recursion equations; e.g. for Case (Vb), to 
establish: 

(1) b P(0, x 2 , .... x„, w) ~ Q(x 2 , ..., x„, w). 

(2) b P(y', x 2 , ...,x n , w) ~ 

3z[P(y, x 2 , ..., x„, z) & R(y, z, x 2 , ..., x„, w)]. 

From (1) and (2) by formal induction on y (given by the hypothe¬ 
sis of an intuitive induction on k that b 3!wQ(x 2 ,..., x„, w) and 
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1 - 3!wR(y, z, x 2 , ..x B , w)), then: 

(3) h 3!wP(y,x 2 , ...,x n ,w). 

To establish ( 1 ) and (2), we would begin by formalizing the theory 
of the ( 3 -function, which we gave informally in § 48. It would suffice 
to establish: 

(a) b 3c3dB(c, d, 0 , w). 

((}) b 3c 2 3d 2 {Vi[i<y D 3u[B(c 1( d v i, u) & B(c 2 , d 2 , i, u)]] & 

B(c 2 , d 2 , y', w)}. 

For then the four implications of ( 1 ) and ( 2 ) could all be proved, using 
*180c, (a) (for the second), and ((B) (for the fourth). — We shall not take 
the space to carry out the formalization to establish (a) and ((3). Hilbert 
and Bemays 1934 pp. 401—419 do practically this in another formal 
system, whence (in the manner to be indicated before Example 9 § 74) 
it can be inferred that (a) and (( 3 ) hold in our (classical or intuitionistic) 
system. 



Chapter X 


THE ARITHMETIZATION OF METAMATHEMATICS 

§ 50. Metamathematics as a generalized arithmetic. As we 

remarked in § 42 (following Godel 1931 ), by selecting a particular enumer¬ 
ation of the formal objects, or a particular correlation of distinct natural 
numbers to the distinct formal objects (not using every number), and then 
talking about the correlated numbers instead of the formal objects, 
metamathematics becomes a branch of the arithmetic of the natural 
numbers. In this chapter we shall carry out such an arithmetization of 
metamathematics, using a Godel numbering similar to that of Hilbert and 
Bernays 1939 . 

However instead of carrying out the arithmetization directly, we shall 
first represent the formal system in an intermediate way as a generalized 
arithmetic, and then represent the generalized arithmetic in the ordinary 
arithmetic. This will bring out some analogies which are of heuristic value, 
and the representation of the system as a generalized arithmetic will be 
of interest on its own account. 

The arithmetic of the natural numbers deals with the domain of objects 
which is generated by starting with one primitive object 0 and applying 
one primitive operation ' or +1 (§ 6 ). 

A generalized arithmetic (for the present purpose) is obtained by 
supposing one or more zeros, and one or m )re successor operations. We 
shall adhere to the convention (not the only useful one) that objects 
generated from the primitives in distinct ways are distinct. There are 
several possibilities for representing the formal system as a generalized 
arithmetic. Hermes 1938 considers an arithmetic with the empty ex¬ 
pression as the zero, and the operations of suffixing one of the formal 
symbols as the successor operations. 

The generalized arithmetic which we select has a more complicated 
structure as an arithmetic, but is designed to represent directly the 
grammatical and logical structure of the formal objects. There shall be 
r +1 zeros 0 0 , 0 1 , ..., 0 r , where r is a natural number to be specified later; 
and there shall be one successor operation applying to an s +1 -tuple of 
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arguments, for each of the natural number values of s to be specified 
later. The result of the successor operation applied to x 0 , x x , ..., x, as 
arguments is written “(x 0 , x x ,..., x,)” or also sometimes “x 0 (x 1 ,.... x 3 )’\ 
We call the objects belonging to this generalized arithmetic entities. 

We can express this by an inductive definition (analogous to that given 
for ‘natural number’ in § 6 ). 1 . 0 0 , 0 1 ( .. 0 r are entities. 2 . For each 

admitted s, if x 0 , x x , ..., x, are entities, then (x 0 , x x ,..., x,) is an entity. 
3. The only entities are those given by 1 and 2 . 

As we have already indicated, two entities shall be equal, if and only 
if they are generated from the zeros by the successor operations in the 
same way. To say that x and y are equal we write “x X y” (unequal, 
“x x y”)- We use “X” rather than "=" merely to avoid confusion with 
the = of the formal system. 

Axioms characterizing the domain of entities can be stated, analogous 
to Peano’s for the natural numbers (§§ 6 , 7). In particular, they include 
the principle of proof by mathematical induction in the form corre¬ 
sponding to the mode of generation of the domain of entities (or to the in¬ 
ductive definition just given): If the entities 0 0 , 0 X ,..., 0 r each possess 
a certain property, and if for each admitted s, whenever entities x 0 , 
x x ,..., x s possess the property, the entity (x 0 , x x ,..., x s ) also possesses 
it. then all entities possess the property. The statement of the other 
Peano axioms for the generalized arithmetic is left to the reader. 

The process of generating the entities partially orders them (end § 8 ); 
we write "x-< y” to say that x is generated before y in the process of 
generating y. Expressed inductively: 1 . For each admitted s and each 
i <: s, x, -< (x 0 , x x ,..., x s ). 2. For each admitted s and each * <; s, if 
x -< Xj, then x -< (x 0 , x x ,..., x s ). 3 . x -< y only as required by 1 and 2 . 

We define a function of an entity x and a natural number i, which 
gives the predecessors of a successor entity, thus: 



Xj, if x X (x 0 , x x ,..., xj and i < s, 
x, otherwise. 


We now specify for the rest of this chapter that the number r+l 
of the zeros 0 o , 0 X ,..., 0 r shall be thirteen, and we name them as follows: 

3, &, V, -i, V, 3, +, •, ', 0, a, ,. 

We further specify that s admit the values 0, 1 and 2. This completes the 
definition of our generalized arithmetic as a domain of abstract objects 
which can be recognized and distinguished from one another as individuals 
by the mode of their generation. 

We now have to fix how our formal system (as originally introduced in 
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Chapter IV) is to be represented in the generalized arithmetic. This will 
consist in giving a correlation of entities to the objects of that formal 
system. Those formal objects were explained in § 16 as consisting of formal 
symbols, finite sequences of formal symbols (called ‘formal expressions’), 
and finite sequences of formal expressions. It is not necessary to correlate 
an entity to every formal object, but only to those formal objects which 
are significant for the metamathematics. For example, an asyntactical 
expression such as ((0V00= will have no entity correlated to it. Also 
there will be entities which are correlates of no formal object. 

To the first eleven of the formal symbols listed in § 16 we correlate 
the respective entities 13, &, V, -i, V, 3, =, + , •, ', 0, which we are 
designating now by the same symbols, i.e. we correlate respectively the 
first eleven of the zeros of the generalized arithmetic. 

To the variables a, b, c, d,... of the formal system, we correlate, 
respectively, the entities 

a > ( \> a )» (i> (i> a ))> (i> (i> (i> a )))> • • • 

(sometimes written a, a x , a n , a m , ...), i.e. the twelfth zero, and the 
further entities obtained thence by repeated applications of the suc¬ 
cessor operation of the generalized arithmetic with s = 1 and with the 
thirteenth zero as the first predecessor. 

To terms and formulas such as r+s, r', r=s, A & B, -iA, VxA(x) 
we correlate the entities (+, r, s), (', r), (=, r, s), (&, A, B), (-i,A), 
(V,x,A(x)), respectively, where r, s, A, B, x, A(x) are now to be the entities 
correlated to the given r, s, A, B, x, A(x), i.e. we repeat the correlation 
procedure on the given r, s, A, B, x, A(x). 

Example 1. The entity correlated to the formula 3b(—ib—0) is 

(3, („ a), «),0))). 

But hereafter in designating these entities, except when we wish to 
emphasize their structure (i.e. mode of generation) as entities, we shall 
use the former expressions. For example, when V, x, A(x) are entities, 
we may write their successor (V, x, A(x)) as "VxA(x)”; and when + , 
r, s are entities, we may write (+, r, s) as “r+s”. This way of designating 
entities correlated to objects of the formal system will make our state¬ 
ments about the entities read as our former statements about the objects 
of the formal system. 

Furthermore, in dealing with the generalized arithmetic, it is convenient 
to call the entity VxA(x) correlated to a formula (i.e. the entity (V, x, A(x))) 
simply a “formula”, the entity r+s a "term”, etc.; and the system of 
these entities “the formal system as a generalized arithmetic” in contrast 
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to the formal system as originally described, which we can distinguish 
when necessary as the ‘‘formal linguistic system”. 

Proofs and deductions will be represented by entities corresponding to 
them in their tree form (end § 24) rather than to them as finite sequences 
of formulas. Thus, to get the entity corresponding to a deduction in 
sequence form with a given analysis, let that deduction first be put into 
tree form. In tree form it has one of the three forms 


D, 


P 

d; 


P Q 

D, 


where D is a formula, and P and Q are deductions in tree form. In the 
generalized arithmetic, we construe these to be the entities (D), 
(D, P), (D, P, Q), respectively, at the same time of course construing D 
P, Q to be the entities correlated to the linguistic objects D, P, Q. (In 
particular, a formula D, and a deduction consisting of the single formula 
D, become different entities; the entity which the latter becomes is the 
successor with s = 0 of the entity which the former becomes.) 


Example 2. The deduction (6) of § 21, after being rewritten in tree 
form as in Example 1 § 24, becomes an entity which we can write as 
(8, (7, (1), (6)), (5, (3, (1), (2)), (4))) where the numbers 1 —8 abbreviate 
the formulas of (6) § 21 now considered as entities. 


This completes the correlation of entities to the significant formal 
linguistic objects. Distinct entities are correlated to distinct formal 
linguistic objects (except in the case of two inessentially differing proofs 
or deductions in sequence form which become the same in tree form). 
The proof of this, for the case of terms and formulas, depends on the 
uniqueness of the scopes of the operators in the terms and formulas as 
formal linguistic expressions (§ 17). 

In thus going over from the formal linguistic system to the generalized 
arithmetic, we have effected two changes, either of which could have been 
effected separately. The first of these is in the structure attributed to the 
formal objects. In the linguistic representation terms and formulas were 
finite sequences of formal symbols, in which the significant parts had 
to be recognized as subsequences, while in the generalized arithmetic 
they are constructed directly out of the significant parts by tne generalized 
successor operation. In the latter the analysis of expressions into their 
significant parts (including the use of parentheses) is transferred from 
the formal level to the exposition of the metamathematics, where we do 
not let it worry us. 

For example, consider (A) D (B) as a formula of the linguistic system. 
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where A and B are formulas (i.e. “A” and “B” are metamathematical 
letters designating formulas). We are then talking explicitly about a 
finite sequence of symbols, with the four parentheses shown being 
symbols of the sequence, and the sequences designated by “A” and “B” 
occurring in the indicated position in the sequence (§ 16). 

On the other hand, consider (A) D (B) as a formula in the system 
of entities, where A and B are likewise formulas. Then “(A) 3 (B)” 
means (D, A, B), which is the successor of the three entities D, A, B. 
The parentheses in “(A) D (B)” and the parentheses and commas in 
“(D, A, B)” are not formal objects, but only part of our intuitive no¬ 
tation for naming the entity under consideration. 

As the second change, we are now taking a different view of the role 
of the symbolism in presenting the formal system. In Chapter IV, the 
objects of the formal system were considered to be linguistic symbols or 
marks, and other linguistic objects constructed from such. In studying 
them we had in principle to watch the distinction between an object and 
a name or designation for the object, and between the mention of an ex¬ 
pression (as itself the object under consideration) and the use of it (in 
designating another object or expressing a proposition). This has been 
emphasized by Frege (1893 p. 4), Carnap (1934 pp. 153—160) and Quine 
(1940 pp. 23—37). To make a statement about an object, ordinarily a 
name for the object is used. (Another method, sometimes applicable, is to 
point to the object, or to use a linguistic construction which amounts to 
pointing to the object, i.e. calls attention to it instead of naming it.) We 
do not speak our friend, but we speak the name of our friend. It is not 
likely that we shall mistake our friend John for a sequence of four letters, 
but in metamathematics as treated in the preceding chapters we did 
have to be careful because we were discussing objects which were them¬ 
selves linguistic. 

One method of obtaining names for linguistic objects is to place them 
in quotation marks. The name of John is “John”, and the name of the 
name of John is “ “John” ”. The name of the name of John consists of four 
letters enclosed in one set of quotation marks; the second set of quotation 
marks used above is employed in naming that. 

A second method is the use of separate metamathematical letters 
and ‘expressions as names for the linguistic objects. 

It need not be strictly forbidden that a specimen of a linguistic object 
be used as name of the object; then the object has two uses, its use as 
the object of study, and its use as name of itself. In the latter use, it is 
called autonymous. 
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The method we used in the preceding chapters was a combination of 
the second and third methods. 

This problem of designation, which is troublesome to treat explicitly, 
is extraneous to the metamathematics as mathematics. The issue can be 
avoided by using only names of the formal objects, and not claiming to 
exhibit the objects themselves. We find it convenient to do this in the 
generalized arithmetic, considering now " D”, “V”, “VxA(x)” as names 
of certain objects (the names are the expressions inside the quotation 
marks), rather than as the objects themselves. We refrain from specifying 
what the objects are, other than that they belong to a domain of abstract 
objects arranged in a certain way in relation to one another, which we 
are calling entities (cf. § 8). The objects named could be formal symbols, 
formal expressions, etc. in the sense of Chapter IV, though we now leave 
this open as irrelevant for the metamathematics. (While we can thus 
avoid the problem of designation in our metamathematics, it would have 
to be faced in discussing the application of the metamathematics to a 
particular linguistic system.) 

By going over from the conception of the formal system in terms of 
formal symbols, treated as if they were marks on paper, to an abstract 
system of objects, our metamathematics (i.e. the study of the formal 
system) becomes a branch of pure number theory entirely on a par 
conceptually with the arithmetic of the natural numbers and similar 
mathematical disciplines. 

Remark 1 . The usual convention or practice in informal mathe¬ 
matical writing is to write all the symbols without quotes, so that when 
a symbol is being mentioned rather than used it is autonymous. In this 
book our practice is to employ quotes systematically in metamathe¬ 
matical passages to distinguish the mention of metamathematical 
expressions from their use in designating formal linguistic expressions, 
and elsewhere only for emphasis. 

§51. Recursive metamathematical definitions. The arithmetiza- 
tion of metamathematics will be completed in § 52 by mapping the 
generalized arithmetic into the ordinary arithmetic of the natural 
numbers. Our main objectives are to complete the proof of the lemma 
for Godel’s theorem, and to prove Theorem 31. Both results will follow 
from the result that a certain succession of number-theoretic predicates, 
obtained by the mapping from metamathematical predicates, are all 
primitive recursive. 

It is intuitively clear why these results hold, and would have to hold 
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for any formal system of like structure, when we consider the nature of 
the definitions of the metamathematical predicates in the generalized 
arithmetic. 

Because the entities are generated from zeros by successor operations, 
predicates and functions can be defined over the entities by recursion, 
We use this idea now, without stopping to state an accurate definition of 
‘primitive recursive’ for the generalized arithmetic. 

We give below a series of thirteen definitions of metamathematical 
predicates. Each definition is given by listing the cases in which the pred¬ 
icate is to be true. (A few clauses are starred for reference from § 52.) 

Each definition is either explicit (the predicate being defined not 
appearing in any of the defining clauses), or constitutes a primitive 
recursion (the value of the predicate for a given entity depending on 
values of itself for immediately preceding entities, or similarly with 
parameters), except that in Dn5 and Dn 11 the recursion is on two variables 
simultaneously (one a number variable in Dnll). For the discussion of 
a metamathematical definition not of this nature, cf. § 53. 

The reader should verify that the definitions do define the predicates 
named, as we know them from earlier sections of the book (Chapter IV 
and §§41,50). 

Definitions of metamathematical predicates for the formal 

NUMBER-THEORETIC SYSTEM AS A GENERALIZED ARITHMETIC 

Dnl. y is a numeral. (Abbreviation: 9?(y)-) 

1. y X 0. 

2. y x n' (i.e. y X (', n), cf. § 50), where n is a numeral. 

Dn2. y is a variable. (Abbreviation: 58(y)*) 

1. y X a. 

2. y X x, (i.e. y X („ x)), where x is a variable. 

Dn3. y is a term. (Abbreviation: %(y)-) 

1. y X 0. 

2 . y is a variable. 

3—5. y X r+s or r-s, where r and s are terms, y X r', where 
r is a term. 

Dn4. D is a formula. (Abbreviation: $(D).) 

1 . D X r=s, where r and s are terms. 

2 —5. DXADB, A&B or A V B, where A and B are formulas. 
D X “iA, where A is a formula. 
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6 —7. D X VxA{x) or 3xA(x), where x is a variable, and A(x) is a 
formula. 

Dn5. (t is a term, x is a variable, E is a term or formula, and) D comes 
from E by the substitution of t for (the free occurrences of) x. 
(Abbreviation: <3(D, E, t, x).) 

1. t is a term, x is a variable, E X x, and D X t. 

2—3. t is a term, x is a variable, E is 0 or a variable x x, and D X E. 

4—5. E is a term or formula, and E is (e 0 , e x ) and D is (e 0 , d x ), where 
e 0 X | and @(d x , e 1( t, x) (so t is a term, and x is a variable). 
E is a term or formula, and E is (e 0 , e 1; e 2 ) and D is (e 0 , d x , d 2 ), 
where e 0 X V or 3, <§>(d lt e 1( t, x) and <S(d 2 , e 2 , t, x). 

6 —7. t is a term, and E is (V, y, e 2 ) and D is (V, y, d 2 ), where y is a 
variable, e 2 is a formula, and either y X x and @(d 2 , e 2 , t, x), 
or y X x and D X E. Similarly for 3. 

Dn6. (E is a term or formula, x is a variable, and) E contains x free. 
(Abbreviation: (£$ (E, x).) 

1 . E is a term or formula, x is a variable, and @(E, E, 0, x). 

Dn7. (t is a term, x is a variable, E is a formula, and) t is free for x in E. 
(Abbreviation: $(t, x > E)). By recursion on E, With seven 
clauses corresponding to 1—7Dn4. For example: 

6 . t is a term, x is a variable, and E X VyA(y), where y is a 
variable, A(y) is a formula, and either E does not contain 
x free, or t is free for x in A(y) and t does not contain y free. 

Dn8. D is an axiom. (Abbreviation: 91(D).) 

1 —10. D X A 3 (B 3 A), where A and B are formulas (Axiom Schema 
la). Similarly for Axiom Schemata lb, 3, 4a, 4b, 5a, 5b, 6, 7, 8. 
(Note: We separate Axiom Schema 10 into two cases (Clauses 
11 and 13), according as the A(x) contains the x free or not. 
Similarly for Axiom Schema 11.) 

*11—12. There exists a t such that D X VxA(x) D A(t), where A(x) 
contains x free, t is free for x in A(x), and <3(A(t), A(x); t, x). 
(Note: That x is a variable, A(x) is a formula, and t is a term, 
is included here in the stipulation “t is free for x in A(x)”. 
Instead of presupposing the convention of § 18 by which A(t) 
stands for the result of substituting t for x in A(x), we make it 
explicit by“©(A(t), A(x), t, x)”.) Similarly for Axiom Schema 11. 

13—14. D X VxA 3 A, where x is a variable, A is a formula, and A 
does not contain x free. Similarly for Axiom Schema 11. 
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15. D X A(0) & Vx(A(x) 3 A(x')) 3 A(x), where A(x) is a formula, 
<S(A(0), A(x), 0, x) and @(A(x'), A(x), x', x). 

16—23. D X a'—b' 3 a=b (Axiom 14). Similarly for Axioms 15—21. 

Dn9. D is an immediate consequence of E. (Abbreviation: (£(D, E).) 

1—2. E X C 3 A(x) and D X C 3 VxA(x), where x is a variable, 
A(x) and C are formulas, and C does not contain x free (Rule 9). 
Similarly for Rule 12. 

DnlO. D is an immediate consequence of E and F. 

(Abbreviation: (£(D, E, F).) 

1. D and E are formulas, and F X E 3 D (Rule 2). 

Dnll. x is the numeral for the natural number x. 

(Abbreviation: 9?u(x, x).) 

1. x X 0 and x = 0. 

2. x X n' and x = n' , where 9?u(n, n). 

Dnl2. Y is a proof. (Abbreviation: ^$f(Y).) 

1. Y X (D), where D is an axiom. 

2. Y X (D, P), where P is a proof, and D is an immediate con¬ 
sequence of {P} 0 . 

3. Y X (D, P, Q), where P and Q are proofs, and D is an imme¬ 
diate consequence of {P} 0 and (Q} 0 . 

Dnl3. A (a) is a formula, x is a natural number, and Y is a proof of the 
formula A(x) (as a predicate of A {a), x, Y). 

(Abbreviation: $f(A(a), x, Y).) 

* 1. A(a) contains a free, ^Sf (Y), and there is an x such that 9iu(x, x) 
and ©({Y} 0 , A {a), x, a). 

2. A(a) does not contain a free, $f(Y) and {Y} 0 X A (a). 

Dn 13a. For each n distinct variables x t .x n and formula A(x x ,..., x„): 

Y is a proof of A(x x ,..., x n ). (Abbreviation: 

.x«.A(x x .x„) (* 1 » • • •» X n ,Y) or A (x x ,. . ., X n ,Y).) 

Similarly. 

§ 52. Gbdel numbering. We now complete the arithmetization 
of metamathematics by representing the generalized arithmetic within 
the arithmetic of the natural numbers. First, we correlate distinct odd 
numbers to the zero entities, thus: 

3&V- 1 V3=+ - ' 0 a t 

3 5 7 9 11 13 15 17 19 21 23 25 27 
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Then, whenever x 0 ,..., x, are entities to which respective numbers 
x 0 ,..., x, have already been correlated, we correlate to the successor 
entity (x 0 ,..,, x s ) the number »•,. .-Pg s (#18 § 45). 

By a mathematical induction corresponding to the definition of entity 
a natural number > 0 is thereby correlated to each entity. This number 
we call the Godel number of the entity, or say that it represents the entity 
(or formal linguistic object to which the entity in turn is correlated). 
Since only even numbers are correlated to successor entities (because 
p 0 = 2 and x 0 #0), and because a given positive integer has the form 
Po°‘ • • • 'Ps s ( x o> •••,*„> 0) for at most one s and x 0 ,..., x s , distinct 
numbers are correlated to distinct entities. 


Example 1. The Godel number of 3b(-i b=0), or as an entity 
(3,(„ a), K (-. („ a), 0))), is 2“-3* 27 -* 25 -5 29 3215 ' 3227 ^ 523 


Since x { < p % a -... -p* s for 0 <. i < s, entities x and y in the relation¬ 
ship x«< y are always represented by natural numbers x and y in the 
relationship x < y. (However x < y may hold for pairs of numbers x 
and y correlated to entities x and y without x -< y holding; e.g. 3 < 5 
but not D -< &.) 

If x is an entity of the successor form (x 0 ,..., x,) with s > i, and x 
is the Godel number of x, the Godel number of the predecessor {x} f is 
(*)« (#19 §45). 

When we piss from entities to their Godel numbers, a predicate or 
function of entities becomes a predicate or function of Godel numbers. 
A number-theoretic predicate or function obtained by extending the 
definition of the latter to all natural numbers we say corresponds to the 
original predicate. In particular, in the case of a predicate ^(x 1( ..., x„), 
we shall understand by the corresponding number-theoretic predicate 
P(x lt ...,*„) that one which is obtained by taking the value to be f 
whenever not all of x v ..., x n are Godel numbers; i.e. 

P(x v . ..,x n ) = {x v ..., x n are Godel numbers of entities x x ,..., x n , and 

$(x 1( ..., x„)}. 

Similarly when some of the variables of the original predicate already 
range over natural numbers (e.g. Dnll § 51). 


Lemma 19. For each of the predicates defined by Dnl — Dnl3, Dnl3a, 
the corresponding number-theoretic predicate is primitive recursive. 

Proof. To illustrate the method, let us treat Dn3, assuming Dn2 
already treated. 
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We can write Dn3 symbolically as follows. 

£(y) = y X 0 
VSS(y) 

(!) v [y x (+, {y} lf {y} 2 ) & $({y)i) & £({y} 2 )] 

v [y x (•, {y} x , {y} 2 ) & £({yh) & $({y} 2 )] 
V[yX(',{y} 1 )&2({y} 1 )]. 

The five disjunctive members correspond to the five clauses of Dn3 as given 
in §51. For the third clause we observe that ify X r+s, i.e. y X (+, r, s), 
for any entities rands, then r X {yh, s X {y} 2 . The fact that y X ( + , r > s ) 
for some entities r and s is then expressed by y X (+, {y} x , (y} 2 )- 
In (1) let us replace the zero entities +, •, 0 by their Godel numbers 

17, 19, 21, 23, the assumed predicate 93(y) by its corresponding number- 
theoretic predicate V(y), X by =, successor entities (x 0 ,...,x s ) by 
Po°'■ ■ ■'Ps s ’ predecessors {y}, : by (y),-, and write T(y) instead of %(y) 
for the predicate being defined. This leads formally to the following 
number-theoretic equivalence. 

T(y) s y=23 

V V(y) 

(2) V [y—2 17 -3 (y>1, 5 (i/)2 & X((y) a ) & T((y),)] 

V [y-2 19 -3 (3/)2 -5 (s ' )2 & r((y) x ) & r((y) a )] 

V [y=2 21, 3 (2/)l & r((y) x )]. 

Now (2) defines a predicate T{y) by course-of-values recursion in the 
arithmetic of the natural numbers, since (y)i < y for y 0; and by 
#G with ##2, 3, 14, 19, A, C, D and our hypothesis that V is primitive 
recursive, T(y) is primitive recursive (cf. Example 3 §46). 

It remains to prove that the predicate T(y) defined by (2) is the 
number-theoretic predicate corresponding to %(y ). For this purpose, we 
prove two propositions by course-of-values induction on y: 

(a) If T(y) (by (2)), then y is the Godel number of an entity y such that 

£(y) (by (i)). 

(b) If 2(y) (by (1)), and y is the Godel number of y, then T(y) (by (2)). 

Proofs, (a) By (2), T(y) is true only when one of the disjunctive 
members (or “clauses”) on the right of ( 2 ) is true, so we have five cases 
to treat. Case 2: V{y). Then, since V is the predicate corresponding to 93, 
y is the Godel number of a variable y, i.e. 93 (y); and by the corresponding 
clause of ( 1 ), 2(y). Case 3: y = 2 1 T -3 ( ^-5“' ) * & T((y) x ) & T((y) 2 ). Then 
(y)i> (y ) 2 < y; so from T((y) x ) & F((y) 2 ) by the hypothesis of the induction 
on y, (y) x and (y) 2 are Godel numbers of entities r and s such that 2(r) 
and X(s). Then y (= 2 17 • 3 (v)l • 5 ivh ) is the Godel number of ( + , r, s). 
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Calling the latter y, then r x {y} 1( s X {y} 2 ; so y x (+, {yh, {y} 2 ) & 
2({y)i) & X({y} 2 ), an d by the respective clause of (1), %(y). The proof 
of (b) is similar, with cases from the disjunctive members (or “clauses”) 
on the right of (1). 

The other definitions of predicates in the list Dnl — Dnl3, Dnl3a are 
handled similarly, except as noted below for the recursions on two 
variables and for the starred clauses. After translating (as from (1) to (2)), 
the number-theoretic predicate defined is true only for Godel numbers 
as arguments, since in each clause of the original definition each variable 
entity is required to satisfy an earlier predicate of the list, or is a suc¬ 
cessor of entities which are fixed or must satisfy such predicates or the 
predicate being defined. For example, we translate Clause lDn9 as 

*=2 s -3 Wl -5 w> & d= 2 3 -3 (£) '-5^_' 3W),,l ' 5<e)2 & V{(d) 2 x ) 
&F((e) 2 )&F((e) 1 )&CF((e) 1 ,(d) 2<1 ), 

where “( d) 21 ” abbreviates < ‘((^) 2 ) 1 ”- Note that CF(e, x) is not quite the 
number-theoretic predicate corresponding to (£g(E, x), since it is true when 
e or x is not a Godel number. But this does not matter here, since (e) 1 
occurs also in F((e) x ) and ( d) 21 also in V((d) 2 l ), corresponding to the 
stipulations in lDn9 that C is a formula and x is a variable. 

Dn5 and Dnl 1 are recursions (of a simple kind) on two variables. For 
Dn5, e.g., let T(z,t,x) =s {z = 2 d -3 c where S(d, e, t, x)}. Then T satisfies 
a course-of-values recursion on the one variable z, and S(d, e, t, x) = 
T(2 d -3 e , t,x). 

It remains to consider the starred clauses 11 and 12Dn8, and lDnl3 
and 1 Dnl3a. We translate HDn8.as 

(Et) l<d [d=& CF((d) 12 , (d) hl ) 

& Fit, (d) 1<v (d) h2 ) & S((d) 2 , (d) h2 , t, (i) lfl )], 
and use #E. The bound t<d is justified, since when A(x) contains 
x free, t -< A(t) -< VxA(x) DA(t) X D. We translate lDnl3 as 
CF{a, 25) & Pf[y) & (En) n<y [Nu{n, x) & S((y) 0 , a, n, 25)]. 

This completes the proof of Lemma 19. The gist of this proof is that 
the primitive recursions in the generalized arithmetic become course-of- 
values recursions in the ordinary arithmetic, since the Godel numbering 
preserves the order relationships although it destroys the relationships of 
immediate succession. It is necessary to verify that the range of each 
variable which we elected to introduce with a quantifier rather than as a 
function of the independent variables of the predicate being defined 
(e.g. t in HDn8 and x in !Dnl3) can be restricted. 
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Lemma 20. Under the Godel numbering of this section, the predicates 
A {a, b) and B(a, c) of Lemma 21 § 42 are primitive recursive. 

Proof. By Lemma 19, since we can express A (a, b) and B(a, c) in 
terms of the number-theoretic predicate Pf(a, x, y) corresponding to 
the predicate ^$f(A(d), x, Y) of Dnl3 thus, 

A {a, b) h= Pf(a, a, b), B(a, c) = Pf( 2 9 -3°, a, c). 

Lemma 21 § 42 follows now using Corollary Theorem 27 § 49. 

Theorem 31. For any given formula A (a) (cf. Dnl3), the predicate 
'A(x) is provable' (as predicate of x, where x is the numeral for x) is ex¬ 
pressible in the form (Ey)R(x, y) where R is primitive recursive ; i.e. given a 
formula A (a), a primitive recursive predicate R(x, y) can be found such that 

(Ey)R(x, y) = f-A(x). 

(Similarly for A(x x , ..., x„); cf. Dnl3a.) 

Proof. A formula is provable, if and only if there exists a proof of it. 
Let a be the Godel number of the particular formula A (a) of the hypothesis 
of the theorem, and set 

R(x, y) = Pf(a, x, y). 

(For A(x x , ..., x„), set R(x v ...,x n ,y) = Pf±(x v ...,x n , y).) 

Example 2. Let ©(E, t, x) x {the result of substituting t for x in 
E, if t is a term, x is a variable, and E is a term or formula; otherwise, E}; 
and let 9iu(x) X {the numeral x for the natural number x). These meta- 
mathematical functions ©(E, t, x) and 9iu(x) can be defined by recursion, 
similarly to the predicates ©(D, E, t, x) (Dn5) and 9tu(x, x) (Dnl 1); and 
the corresponding number-theoretic functions S(e, t, x) (= e, when e, t, x 
are not all Godel numbers) and Nu(x) are primitive recursive. 

*§ 53. Inductive and recursive definitions. The definitions of 
‘term’ and ‘formula’ were given originally in § 17 as inductive definitions. 
Other examples of inductive definitions are the definition of ‘natural 
number’ (§ 6), of ‘provable formula’ in the first version (§ 19), of ‘primitive 
recursive function’ if phrased similarly (§ 43), and of ‘entity’ (§ 50). The 
results of this chapter may be viewed in terms of the relationship between 
inductive and recursive definitions. We begin with a few remarks about 
inductive definitions generally. 

Inductive definitions occur in two different roles, and we call them 
fundamental and non-fundamental accordingly. To which category a 
given inductive definition belongs may vary with the context or theory in 
which it is being used. 
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For the generalized arithmetic, the definition of ‘entity’ is the 
fundamental inductive definition. It establishes the domain of objects for 
the arithmetic. An entity is thereafter understood to be given, when and 
only when its mode of generation under the inductive definition of 
‘entity’ is given. 

Then the non-fundamental inductive definitions, such as those of 
‘term’, ‘formula’ and 'provable formula’, apply to objects already known 
in their status as entities. These definitions each define a class of entities, 
i.e. a subclass of the entities. We can ask in the arithmetic whether or not 
a given entity belongs to the subclass; and we can associate with the 
subclass a predicate taking the value t for an entity belonging to the 
subclass and f for an entity not belonging. We can regard the non¬ 
fundamental inductive definitions as definitions of these predicates. 

Thus the fundamental inductive definition establishes the range 
of a variable, over which one may subsequently define predicates by non¬ 
fundamental inductive definitions (including as a special case the constant 
predicate t). 

The manner in which a non-fundamental inductive definition defines 
a predicate is the following. The direct clauses tell us certain objects for 
which the predicate takes the value t. The extremal clause says that those 
are the only objects for which the value is t, so that we can attribute the 
value f whenever we are able to see that the direct clauses do not require 
the value to be t. 

The direct clauses generally include basic clauses, each of which tells 
us outright (or under hypotheses involving only previously defined 
predicates) that the value is t for a certain object, and inductive clauses, 
each of which tells us that, if the value is t for certain objects (and possibly 
under hypotheses involving previously defined predicates), then the value 
is t for the object related to those in a given way. (If basic clauses are 
missing, then the predicate takes the value f for all arguments. If in¬ 
ductive clauses are missing, the definition is simply an explicit definition 
by cases.) 

Non-fundamental inductive definitions can also be used to define 
predicates of more than one variable. Such a definition sometimes has 
the form of an inductive definition of a class depending on a parameter 
(e.g. that of ‘ <’ § 6), and in general it can be considered as the inductive 
definition of a class of ordered «-tuples. 

Inductive definitions, both fundamental and non-fundamental, justify 
corresponding forms of ‘proof by mathematical induction’. Those cor¬ 
responding to the inductive definitions of ‘natural number’ and ‘entity’ 
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have already been mentioned (§§ 7 , 50 ). As another example, the principle 
of induction corresponding to the inductive definition of ‘provable for¬ 
mula’ is this: If every axiom has a certain property, and if whenever the 
premises for a formal inference have the property so does the conclusion, 
then every provable formula has the property. (This induction principle 
could have been invoked for the proof of Theorem 9 § 28 , instead of 
course-of-values induction on the length of a proof. Then Lemmas 12a 
and 12b themselves constitute the basis and induction steps of the proof.) 

In the same way, fundamental inductive definitions (under the con¬ 
vention that differently generated objects are distinct) justify‘definitions 
by induction’ or ‘recursive definitions’ of a function over the domain 
established by the inductive definition. (But a recursive procedure cor¬ 
responding to a non-fundamental inductive definition of a class which 
allows an object to be recognized as in the class by different successions 
of applications of the direct clauses may lead to more than one function 
value for such an object, e.g. “<p(A) = 0 if A is an axiom, 9(A) = 9(B) -f 1 
if A is an immediate consequence of B, and 9(A) = 9(B) + 9(C) + 1 
if A is an immediate consequence of B and C” does not define a single¬ 
valued function 9 from provable formulas to natural numbers.) 

Predicates can be introduced from recursively defined functions 
serving as their representing functions, or often as we have seen in § 51 
by recursive procedures directly. 

In recursions (of such kinds as we have been considering for the simple 
arithmetic in Chapter IX) the value of a function or predicate, say of 
one variable, for any given non-zero argument is determined from the 
values of the same for only arguments preceding the given argument in 
terms of the order of generation of the domain by the fundamental 
inductive definition. This has the consequence that we can prove by a 
corresponding induction that a recursively defined predicate takes the 
value t or f for every argument. Thus the law of the excluded middle is 
proved intuitionistically to apply to every proposition taken as value of 
a recursively defined predicate. 

This is not in general so for an inductively defined predicate, as the use 
of the extremal clause to assign the value f whenever the direct clauses 
do not assign the value t may leave us without the knowledge of effective 
meafls to determine which is the value for any given argument (s). 

In a special case it is so, namely (e.g. for the inductive definition of a 
class) when the order in which the inductive clauses introduce members of 
the class agrees with the order of generation of the objects under the fun¬ 
damental inductive definition. It is inductive definitions of this sort 
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which we can recast as recursive definitions, as we did for those of ‘term’ 
and ‘formula’ in §51. 

An inductive definition not of this sort is that of ‘provable formula’ 
in the first version (§ 19). In the second version, the definition is set apart 
from the metamathematical definitions of § 51 by the fact that an existen¬ 
tial quantifier “there exists a proof Y” is used without a bound being 
known for its variable Y (in contrast to 11 — 12 Dn 8 and lDnl3). (Cf. § 30.) 
From this second version, we obtain as the corresponding number-theo¬ 
retic predicate (Ey)[Pf(y) & {y) 0 =d] (cf. Dnl2), which is of the form 
(. Ey)R(d , y) where R is primitive recursive. 

When the form of an inductive definition (with elementary direct 
clauses) is specified in a natural way, the predicates P(x v ..., x n ) 
definable by use of inductive definitions in the natural number arithmetic 
are exactly those expressible in the form ( Ey)R(x v ..., x n , y) with R 
primitive recursive. The proof can be given by an extension of the above 
methods, as was suggested in Kleene 1943 pp. 66—67; or by another 
method, indicated in Kleene 1944 * p. 48. 
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GENERAL RECURSIVE FUNCTIONS 

§ 54. Formal calculation of primitive recursive functions. 

Each of the schemata (I) — (V) of § 43 considered intuitively is an 
operation defining a function 9 from zero or more given functions. 
Actually we stated the schemata by means of equations. 

Let us review the manner in which the equations define the function 

9 , to see whether we cannot analyze our use of them in determining 
particular values of 9 into formal operations. 

Example 1 . Let x be a given function, two of whose values are: 
1. x(0,4) = 7. 2. xU>7) = 7. 

Let 9 be introduced by Schema (Va) with q = 4, thus: 

3 . 9 ( 0 ) = 4 . 4 . 9 (/) = x(y, 9 (y))- 

In § 43 we convinced ourselves that, for any number y, the recursion 
equations for 9 determine the corresponding value 9 (y) of 9 , if the values 
of x are already determined. In particular, with the two values of x just 
given, the reasoning of § 43 tells us that 9 ( 2 ) = 7. We now ask: What 
sorts of formal inferences will enable us to deduce the equation “ 9 ( 2 ) = 7” 
from Equations 1 — 4 ? 

Substituting " 0 ” for “y” in Equation 4 : 

5- <P(1) = X(0,9(0)). 

Replacing " 9 ( 0 )” in the right member of Equation 5 by ”4” from 
Equation 3: 

6 - 9(0 = X(°. 4). 

Four more steps of these two sorts complete the deduction: 

7. 9 ( 1 ) = 7 — Replacement, 6, 1 . 

8 . 9 ( 2 ) =,x(l, 9(0) — Substitution, 4. 

9. 9 ( 2 ) = ^(1,7) — Replacement, 8 , 7 . 

10 . 9 ( 2 ) — 7 — Replacement, 9, 2 . 

Thus a substitution and a replacement operation suffice for the de- 
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duction from the given equations 1 —4 of the equation “<p(2) = 7” 
which states that the value of 9 for the argument 2 is 7. Moreover, 
quite evidently, no succession of these two sorts of inferences can lead 
from Equations 1 — 4 to any other equation whose left member is 
“<p( 2 )” and whose right member is a numeral. 

Example 2 . Suppose now more particularly that x is the constant 
function {C\) defined by the equation 

2 . x(y > z ) = 7, 

while 9 is defined from x as in Example 1 . (Then 9 is primitive recursive, 
with x* 9 as a primitive recursive description.) Now we can deduce 
Equations 1 and 2 from Equation —2 by substitution, as follows: 

— 1 . x(0> z ) = 7 — Subst., —2. 0. x0> z ) = 7 — Subst., —2. 

1 . x(0, 4) = 7 — Subst., - 1 . 2 . x(l,7) = 7 — Subst., 0 . 

Combining these two deductions with that of Example 1 , we obtain 
a deduction of ” 9 ( 2 ) = 7" from the three equations —2, 3, 4 defining 9 ab 
initio. 

In these examples we have been considering questions of a formal 
kind, without having explicitly set up a formal system in advance. We 
shall now establish a suitable formal system, and make our discussion 
rigorous as a metamathematical discussion referring to this system. 

The new formal system we call the formalism (or formal system) of 
recursive functions. We describe it now in the linguistic manner, and 
later (§ 56) as a generalized arithmetic. 

The formal symbols of the system are as follows: = (equals), '(successor), 
0 (zero), a, b,c, .... a v <j 2 , ... (variables for natural numbers), f, g, h, 
.... fi.fi, ... (function letters, i.e. symbols for unspecified number- 
theoretic functions), (,) (parentheses), and , (comma). A (potentially) 
infinite list of variables and of function letters are supposed to be given. 

We call f, g, h, ..., f, f 2 , ... ‘‘function letters” rather than “function 
symbols” here to distinguish them from '. Also the name is appropriate, 
because they will have a role similar to that of the predicate letters in the 
pure predicate calculus, i.e. they are to be interpreted as expressing dif¬ 
ferent functions at different times, but there is no postulated rule of sub¬ 
stitution for them. In discussions in which f, g, h, etc. express fixed 
functions, and are not being treated differently from e.g. we may 

call them “function symbols”. 

The formal expressions 0 , O', 0 ”, ... we call numerals. As before 
(§ 41), we abbreviate them respectively by “0”, "1”, "2”, ...; and we 
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continue to use the convention whereby "x”, “y”, etc. designate the 
numerals for the natural numbers designated respectively by “x”, “y”, 
etc. 

The terms are 0, the variables, and expressions of the form r' where r 
is a term, or f(r 1 , ..., r n ) where f is a function letter and r 1( ..., r n are 
terms in > 0, omitting the parentheses when n — 0). 

A formal expression r=s where r and s are terms is an equation. 
The equations are the only “formulas” for this system. By a system 
of equations we mean a finite sequence e 0 , ..., e, of equations (not empty, 
unless otherwise stated). 

No axioms will be provided; and we shall define only ‘deducibility’ 
but not ‘provability’. 

The rules of inference shall be a one-premise substitution rule R1 
and a two-premise replacement rule R2, as follows. 

R1: to pass from an equation d containing a variable y to the equation 
which results from d by substituting a numeral y for y. 

R2: to pass from an equation r=s containing no variables (the major 
premise) and an equation h(z 1 , ...,z p )—z where h is a function 
letter and z v .... z P , z are numerals (the minor premise) to the 
equation which results from r=s by replacing an occurrence of 
h(z 1( ..., z P ) in s (or several such occurrences simultaneously) by z. 

A deduction of an equation e (the endequation of the deduction) from a 
system (or set, possibly infinite) E of equations is to be one in tree form 
(end § 24); i.e. it shall have one of the three forms 

c where c is one of the equations of E, 

W where W is a deduction from E, and c is an immediate consequence 
c by R1 of the endequation of W, or 

WX where W and X are deductions from E, and c is an immediate 
c consequence by R2 of the endequations of W and X respectively. 

If there is a deduction of e from E, then e is deducible from E (in symbols, 
E h e). 

Example 2 (continued). Translating Equations —2 to 10 into the 
new formalism (using the formal function letters f, h for the intuitive 
function letters “cp”, “x”> and b, c for the intuitive number variables 
“y”, ‘V’), and going over from the sequence to the tree form, we obtain 
the following figure. 
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4 2 . m=h{b, f(b)) 

5. f(\)=h(0,f(0)) 3. /'( 0 ) —4 -2 v h(b,c)=7 

(a) j — 1 . h(0, c)=7 

4 v f(b')=h(b,f(b)) 6 . f(\)=h(0,4) l./i(0,4)=7 -2 2 .h(b,c)=7 

8 . /-(2)=/i(l,/“(I)) _ 7. AO-7 0- 6(1, c) =7 

9. f(2)=h(\,7) 2. h(l, 7) =7 

10. f{2)—7 

This is a deduction of the equation f{2)—7 from the system of equations 

A(M= 7. } (b,) 

/■(°)= 4 . ) (b) 

m=h(b,m).) (bs) 

Example 1 (continued). Consider the part of the tree (a) without the 

(occurrences of) equations 0, call it (a 2 ). This is a deduction 

of f(2)=7 from h{ 0, 4)=7, A(l,7) =7 and (b 2 ). 

Remark 1 . One may ask : Why take the trouble of translating ? The 
reasons are of course the same as those which dictated our use of a special 
symbolism in the number-theoretic formal system of Chapter IV, distinct 
from the symbolism of intuitive number theory. Recapitulating: The 
translation now from " 9 ”, “x”, “y”, “z” to f, h, b, c is called for under 
the linguistic conception of a formalism, because the formal symbols must 
be from a fixed preassigned list of symbols considered in the metamathe¬ 
matics as mere marks. To use these autonymously (§§50, 16) without 
risk of confusion, most of them should be from a special alphabet. Under 
the conception of a formalism as a generalized arithmetic, we may if 
we wish consider that f is (i.e. that "f” is a name for) “ 9 ” (but then it is 
a name for “ 9 ” only, and not sometimes for “<]/’, etc.). The “f” used e.g. 
in the definition of 'term', on the other hand, is a name for an unspecified 
one of f, g, h, etc. (and if we consider that “f” is a name for ” 9 ”, "g” for 
"<j/\ “h” for “x”, etc., then "f” is a name for an unspecified one of 
" 9 ”, "4 “x”> etc.). The net result of translating is to enable us to talk 
clearly about the symbols as distinguished from the functions, numbers, 
etc. without going to the trouble of continually using quotation marks and 
related devices (such as Quine’s "corners” 1940 ). 

Suppose that a function 9 of n variables has been defined intuitively 
from l (;> 0) functions ..., tjq of m v .. .,tn l variables, respectively. 
Before we can discuss whether a given system E of equations "defines” 
9 from (Jq, ..., i n the formalism, we must say which function letters 
f, gi, .... g[ are to express 9 , (fq, .... respectively. 
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It is convenient to employ conventions which make these letters rec¬ 
ognizable from E itself. It suffices to consider systems E in which the 
first (leftmost) symbol of the last equation is a function letter f; and we 
call this the principal function letter of E, and use it to express 9 . The 
distinct function letters which occur in right members of equations of 
E but not in left members we call the given function letters of E. It suffices 
to consider systems E in which there are l of these; and we use them 
g v ..., gi in the order of their occurrence in the preassigned list of function 
letters to express respectively. It suffices to use systems 

E in which f, g x , ..., g ( occur only in terms formed with n, m v .. .,m x 
arguments, respectively (or we could for the present purpose count as 
distinct a function letter with each number of arguments, as we did 
predicate letters in §31); we then call the other function letters 
(if any) occurring in equations of E the auxiliary function letters of E. 

When l > 0, we shall need to have available as ‘‘assumption equations" 
not only the system E which we associate with the scheme defining 9 
from ^ 1( but also equations giving the values of the functions 

..., Let denote the set of the equations 

Si(yv • • •. y mj) =y Where .... y mj ) = y, for j = 1, ..., /and all tnf 

tuples y v ..., y mj of natural numbers (y v ..., y mj , y being the numerals 
for the numbers y v ..., y mj , y). This set of equations is infinite, when 
l > 0 (unless m 1 + ... + m x = 0 ); empty, when l — 0. 

We bring these ideas together in the following metamathematical 
definition. A system E of equations defines 9 recursively in (or from) 
<!»!, ...,tjh> if f°r each w-tuple x v of natural numbers: 

E b f(x x , ...,x„)=x, where f is the principal function letter 
of E, g x , ..., gi are the given function letters of E in order of their 
occurrence in the preassigned list of function letters, and x is a numeral, 
if and only if cp(x v ..., x n ) = x. 

In other words, E defines 9 recursively from ij>i» . if (for f, 

g x , ...,g ; as described): Ejj;"^,E h f(x lf ..x„)=x where 
X x , ..., x n , x are numerals, if (completeness property) and only if (con¬ 
sistency property ) f(x x , ..., x n )=x £ EjP. 

Examples 1 and 2 (continued). The principal function letter of (b) 
is f; h is ah auxiliary function letter; and quite evidently (b) defines 
recursively the function 9 (where 9 (y) = 4 if y = 0 , and 9 (y) =7 if 
y > 0). The system (bj) defines y (= C\) recursively; and (b 2 ) defines 
9 recursively from x> with h as the given function letter. 
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Theorem II. If 9 is primitive recursive in ijq, ..then there is a 
system E of equations which defines 9 recursively from (p v ..., 

It is quite evident that we obtain such a system E by translating the 
schema applications for any primitive recursive derivation <p 1( ..., y k 
of 9 from t{q, ..into the formalism, if we choose the function letters 
suitably, and if we first arrange (if necessary) to have each used in some 
schema application (since our conventions provide that t|q, be 

expressed by function letters g x , ..., g ( , respectively, all occurring in E). 
However we shall give the metamathematical analysis in detail, with five 
lemmas, to lay the basis for brief treatment of similar matters later. 

By the principal branch of a deduction, we mean the branch which, 
traced upward from the endequation, contains the major premise at each 
application of R 2 . The equation which stands at the top of the principal 
branch we call the principal equation. The deductions (occurring as parts 
of the given deduction) of the minor premises for the applications of R2 
along the principal branch we call the contributory deductions. 

Example 2 (continued). The principal branch of (a), read down¬ 
wards, consists of the equations numbered 4 V 8 , 9, 10. The principal 
equation is f(b')—h(b, f(b)). The two contributory deductions are the 
trees ending with 7 and with 2. 

The principal branch of a deduction of an equation of the form 
f(Xj, ...,x n )—x where f is a function letter and x v ...,x„,x are 
numerals, read downwards, consists of zero or more applications of Rl, 
followed by zero or more applications of R 2 . The applications of Rl 
substitute respective numerals for the variables of the principal equation, 
until the left member becomes f(x x , ...,x„). The applications of R2 
replace parts h (z v ..., z p ) in the right member (originally present or 
resulting in the course of these replacements) by respective numerals z, 
until the right member becomes x. 

By the identical schema, we mean: 

<p(*i» • x„) = <!>(*!, .. x n ). 

Lemma Ha. If 9 is an immediate dependent of ijq, ..by one of 
Schemata (I) — (V) or the identical schema, then the system E of equations 
obtained by translating the informal equations of the schema application into 
the formalism {with any appropriate choice of the function letters) defines 
9 recursively from <J t v ..., <bi- 

Proof of Lemma Ha. The proofs parallel the informal reasoning 
by which we recognized that the schema applications define the functions. 
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Schema (Vb). The E is of the form f( 0 , x 2 , ..x„)=g(x 2 , ..x n ), 
f(y', x 2 , ..., x n )=h(y, f(y, x 2 , ..x B ), x 2 , ..., x B ). Choose any n— 1 
numbers x 2 , ...,x n . We use induction on x v Basis: x x — 0. Then 
9 (* x> = <|/(*„ Now Ej|,E b f(x 1( ..x n )=x for 

x — <p(x v . .., x n ), using the first equation of E as the principal equation,, 
and the equation g(x 2 , ..x„)~x from E|* as a minor premise for R2. 
Moreover, to deduce f(x x , ...» x„) = x for any numeral x, we must use 
the same principal equation and the same substitutions of numerals by 
R1 (apart from their order), since from no other equation of E^£, E 
and by no other substitutions can we obtain an equation with f (x x , ..., x„} 
(for x 1 — 0 ) as left member. Then the replacement step by R 2 is also 
uniquely determined. So Ejj(£, E b f(x x , ..., x n )=x with x a numeral 
only for x = y{x lt ..., x n ). Induction step: x 1 = y'. Similarly. 

Lemma lib. Let D be a set of equations (.finite or infinite), F be a 
system, of equations whose left members contain no function letters which 
occur in {equations of) D, and g be a function letter occurring in D. Then 
D, F b g(.Vi, • • ■,y m )=y where y v ...,y m ,y are numerals, only if 

D h s(yi> • -•>y m )=y- 

Proof of Lemma lib. Consider any deduction from D, F of an 
equation of the described form g(y x , ..., y m )=y. We prove by course- 
of-values induction on the height t of this deduction that only equations 
of D are used in it as assumption equations (i.e. occur in it at the tops of 
branches). The principal equation is an equation of D, since its first symbol 
is g, which occurs in D and hence not in the left member of an equation 
of F. Each contributory deduction is of height < t, and terminates in 
an equation h(z x , ..., z v )—z, where h is a function letter occurring on 
the right side of the principal equation, and thus in D. Therefore, by the 
hypothesis of the induction, the contributory deductions use only as¬ 
sumption equations from D. 

Lemma lie. Let D and F be as in Lemma lib. Let G be the set of the 
equations of the form g {y v ..., y m ) —y, where g is a function letter oc¬ 
curring in both D and F and y v ..., y m , y are numerals, which are deducible 
from D. Let f be a function letter not occurring in D. Then 
D, F b f(x x , ..x n )=x where x x , ...,x n ,x are numerals, only if 
G, F b f(x x , ...,x n )=x. 

Proof of Lemma He, by course-of-values induction on the height 
t of the given deduction of f(x x , .... x n )—x from D, F. The principal 
equation is an equation of F, because its first symbol is f, which does not 
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occur in any equation of D. Any minor premise along the principal 
branch is of the form h(z 1; ...,z p )=z where h occurs in the right 
member of the principal equation and hence in F. If h occurs in D, 
then using Lemma lib h (z v ...,z p )=z is an equation of the form 
g (y v .. .,y m )= y which is deducible from D, i.e. h(z x , ..., z P )=z £ G. 
If h does not occur in D, then by the hypothesis of the induction 
on t, h (z v ..., z P )=z is deducible from G, F. 


Lemma lid. ( 1 ) Let f 1( ..., i k be distinct function letters in order of 
occurrence in the given list of function letters. ( 2 ) Let <p x = ..., 9 t = t|q. 

( 3 ) For 1 = 1 + 1 , ..., k (k > l), let E< define 9,- recursively from 9 ,- iv 
..., 9 ii q . (?!i 0; jn, .. jiq { < i), with f t as the principal function letter, 
and ij iv .... ij iq . as the given function letters. ( 4 ) Let the auxiliary function 


letters of E i (if any) be distinct from those of each Ey for j + i and from 
f i» • • • > fjf Then for i = 1 , ..., k: E ^ 1 , E; +1 ... E fc (- fy(x x , . .. , Xnf)—x 

where x x , .. x n ., x are numerals, if and only if 


U( x l’ ■ ..,x n ,) = x 8 Ef*'. 


Example £ (concluded). Let k = 2; l = 0 ; 9 X , 9 2 be the primitive re¬ 
cursive description y, 9; and E x , E 2 be (b x ), (b 2 ) interchanging f and h. 


Example 3 . Let 9 X) ..9*. (with k = 9, 1 = 3) be the primitive 
recursive derivation £, vj, 0, XJ\, 0 X , U\, C \, 9 of Examples 1 §§ 44 and 45, 
and E t (i = 4, .... 9) be the equation (c,): 


f i (a,c,b)=a, (c 4 ) 

f 5 (a,c,b)=f 3 (fi{a,c,b)), (c 5 ) 

f 6 (a,c, b)=b, (c e ) 

f-M, c, b)=f i (f 6 {a, c, b), f 5 (a, c, b)), (c 7 ) 

f s (a,c,b)=2, (c 8 ) 

f 9 (a, c, b)=f l (f i (a,c,b),f 1 (a,c,b), f s (a,c,b)). (c 9 ) 


Proof of Lemma lid. We easily see, by general properties of h that 
E J+1 .. .E k h f,(x x , ..., x n ,)=x, if f,(x x , ..x„.)=x 6 Ef*. We 
prove the converse by induction on k. Basis: k = l. Then E J+X ... E fc 
is empty. The conclusion follows by the constitution of E^ 1 (noting 
that it contains no premise for R 1 and no major premise for R2). 
Induction step: k > l. By hypothesis of the induction, only those 
equations of the form f,(x x , .. ., x ni )=x for i <k which £ Ef* are 

Ej.^. By Hypothesis (3) for i = k 


deducible from E^ 1 '"^, E i+1 
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(and writing >” for >*”, for “f” for etc.), only those 

equations of the form f(x 1 , ..., x„)=x which S are deducible from 

9 j ...9.- 

E,. 1 f. 9 ,E k ; and by Lemma lie no others become deducible 
V 


when 

h /j 


is 


replaced in 

p'h-'h 

*V-V 


the list of assumption equations by 

Ej +1 ... E^j. 


Lemma lie. Let cp 1( ..., 9 k be a finite sequence of functions such that 
<p k is 9 and for each i (i — 1, ..., k), either (A) 9,- is one of the functions 
<p lt ..., or (B) 9,- is defined recursively by a system E,- of equations from 
<?i iv • • •, <p/ i9f (qi > 0; jil, .. < i). Then there is a system E of 

equations which defines 9 recursively from <\) v ..., 


Proof of Lemma lie. If it is not already the case that each of the 
<J/s is introduced under (A) as one of the 9’s and is thereafter used under 
(B) as one of the 9 j iv .. 9 ^ for some 9,-, we can make it so (increasing k) 
by introducing some applications of the identical schema (cf. Lemma Ila). 
Then, by rearranging and renumbering the 9’s and E,'s and changing the 
function letters in the latter (if necessary), we can bring about the situation 
described in Lemma lid, with k > l, 9* = 9, and with f x , ..., f ; as the 
given function letters of E i+1 ... E*. Let E be E J+1 . .. E*. 


Proof of Theorem II. By Lemma Ila and the hypothesis of the 
theorem, the hypotheses of Lemma He are satisfied. 


§ 55. General recursive functions. The schemata (I) —(V) are 
not the only schemes of definition of a number-theoretic function, ab 
initio or from other number-theoretic functions, which can be expressed 
by systems of equations, using in the equations only function letters, 
number variables and numerals. 

Let us consider other examples, calling them all “recursions”. We keep 
the equations in the informal language for the time being; and to keep 
them of the sort described now, we eliminate certain other modes of 
expression which were used in Chapter IX, e.g. II (#B), W v <z (# E )> 

y<z 

cases. (#F). 

Thus (a) of Example 1 § 46 we can write now 


(a) 


7 r( 0 , y) = 1 , 

«(*', y) = (y + 9(*))**(*.y). 
9 (y) = «(y. y) 
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(defining the auxiliary function n as well as 9), while (a) of Example 2 
§ 46 is already in the form under consideration. We showed in § 46 that 
these course-of-values recursions are reducible to primitive recursion, i.e. 
the same function can be defined by a series of applications of Schemata 

(I) - (V). 

As another very simple example, consider the recursion 


(b) 


r 9(0, z ) = z , 
l <p(y'. Z ) = <p(y, a ( y , z )). 


This is not primitive, because the z, instead of being held fixed as a 
parameter, has a(y, z) substituted for it in the induction step of the 
definition. This recursion too can be reduced to primitive recursion. 
Expanding (b) for y = 0, 1,2, ... (as we expanded (1) in § 43 ), we find 
that the value <p(y, z) is 

<r(0 , ct(1, ct(2 , ... o(y— 3 , o(y—2, a(y—l,z )))...))). 

Consider the sequence of the numbers z, a{y—\,z), <j(y—2, o(y—1, z)), 
.... <j(0, <j(l, o( 2, ... o(y— 3 , <j(y—2, a(y— 1, z))). ..))), which occur in 
building up this value from the inside instead of as (b) gives it to us. 
These are the values for u = 0 , 1, 2, ..., y of the function \x(u, y, z) defined 
by the primitive recursion 

, b v f n(0, y, z) = z, 

1 1 l |i(«', y, z) = a(y—u', yi{u, y, z)). 

Since the value for u = y is the same as the value <p(y, z), 

(b 2 ) <P [y, z) = p(y, y, z ); 

as can also be seen by using induction on u to prove that 

(c) y', z) = n(u, y, a(y, z)), 

and thence that the 9 defined by (b x ) and (b 2 ) satisfies (b). 

In a similar manner, Peter (1934, 1935a) showed that every recursion 
(called “nested”) in which 9(0, z) is a given function of z, and 9(y', z) 
is expressed explicitly in terms of y, z, given functions (and constants), 
and 9(y, t) as a function of t, is reducible to primitive recursion. 

Are there recursions which are not reducible to primitive recursion; and 
in particular can recursion be used to define a function which is not 
primitive recursive? 

This question arose from a conjecture of Hilbert 1926 on the continuum 
problem, and was answered by Ackermann 1928. Let £„(£>, a ) = a-\-b, 
£ x (6, a) = a-b, E, 2 (b, a) — a b ; and let this series of functions be extended 
by successive primitive recursions of the form £ n .(0, a) = a, £„-(&', a) = 

{Z,n’{b, a), a) (n > 2), so that e.g. £ 3 (£>, a) — a a '' a with b exponents. 
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Now consider 5 „(b , a) as a function 5 («, a) of all three variables. Let a be 

the primitive recursive function defined thus, 


(d) 


a(n, a) 


0 if n = 0, 
1 if n = 1, 
a otherwise 


Then the following recursion defines £(n, b, a), 


(e) 


5(0, 6, a) = «+&. 

< 5(»', 0, a) = a(«, a), 

5(n', 6', a) = 5 (», 5 « 6, «). «)• 


This is an example of a “double recursion”, i.e. one on two variables 
simultaneously. If the function £(», b, a) defined by (e) were primitive 
recursive, then the function 5(«) of one variable defined explicitly from it 
thus, 


(f) 


5(«) = 5(a, a, a), 


would also be primitive recursive. Ackermann’s investigation shows 
that ^(a) grows faster with increasing a than any primitive recursive 
function of a (just as 2“ grows faster than any polynomial in a), i.e. given 
any primitive recursive function <p(a), a natural number c can be found 
such that 5 ( fl ) > ?(#) for all a > c. Thus 5 («), and hence also 5 («, b, a) 
(since 5(a) comes from it by the explicit definition (f)), are not primitive 
recursive. This example was simplified by Peter 1935 (cf. also Hilbert- 
Bernays 1934 pp. 330 ff.) and Raphael Robinson 1948. 

A different method was followed by Peter 1935 in constructing another 
example. The class of the initial functions definable by Schemata (I) — 
(III) is enumerable. Then the class of the primitive recursive functions 
definable using Schema (IV) or (V) just once is enumerable, since the 
m +\-tuples <j>, xi, • • x m for (IV) or the pairs <J>, x (or q, x) for (V) 
formed from an enumerable class are enumerable (§1). Then the primitive 
recursive functions definable using Schema (IV) or (V) a second time are 
enumerable; and so on. Thus the class of all the primitive recursive 
functions is enumerable, as we could also see by enumerating the systems 
E for Theorem II § 54 . In particular, the primitive recursive functions of 
one variable are enumerable. Hence by Cantor’s diagonal method (§ 2) 
they cannot comprise all the number-theoretic functions of one variable; 
and if 

?o(«). 9 i ( a )' 9 z ( a )> • • ■ 

is any enumeration of them allowing repetitions (i.e. any infinite list of 
them in which each occurs at least once), then <p a (a) + l is a number- 
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theoretic function of one variable not in the enumeration, and so not 
primitive recursive. The enumerating function 9(», a) such that 
9 (n, a) — 9 n (a) is a function of two variables which is not primitive re¬ 
cursive, since 9 «,(a)+l = 9(a, «) + l. This of course only establishes 
that number-theoretic functions 9 a (a) + l and 9 (n, a) can be found 
which are not primitive recursive. What Peter did was to show that, for 
a suitable enumeration (with repetitions) of the primitive recursive 
functions of one variable, the enumerating function can be defined by a 
double recursion (besides applications of Schemata (I) — (V)). 

Example 1. Do double recursions lead to any predicates which are 
not primitive recursive? Yes, for 1 — 9(a, a) takes only 0 and 1 as values, 
and cannot occur in the above enumeration, so it is the representing 
function of a predicate not primitive recursive. (Skolem 1944.) 

Peter 1936 studies £-fold recursions for every positive integer k. These 
comprise primitive recursions for k = 1, double recursions for k — 2 , 
and so on. She shows that, for each successive k, new functions are ob¬ 
tained. Functions definable using (besides explicit definition) recursions 
up to order k she calls “^-recursive”. She shows that every 2 -recursive 
function is definable by a single double recursion of the form 


r 9(0, b) = 9(», 0) = 1, 

^ l ?(»', V) = a(», b, 9(», P(«, b, 9(»', b))), 9(»', b)) 

besides applications of Schemata (I) — (V); and similarly (with a scheme 
reducing to (g) for k = 2) for each k > 2. 


Example 2. To settle a point raised in § 45, suppose 9 is 3 -recursive 
but not 2-recursive, and <J> is 2-recursive but not 1-recursive, i.e. not 
primitive recursive. Then “if <{/ is primitive recursive, then 9 is primitive 
recursive” is vacuously true, but "9 is primitive recursive in <J/’ is false, 
since that would make 9 2-recursive. 


These subjects are treated in Peter’s monograph 1951 (not available 
during the writing of the present book). 

It is not to be expected that the &-fold recursions with finite k exhaust 
the possibilities for defining new functions by recursion. In 1950 Peter 
uses “transfinite recursions” (first employed by Ackermann 1940) to 
define new functions. 

This brings us to the problem, whether we can characterize in any 
exact way the notion of any “recursion”, or the class of all “recursive 
functions”. 

The examples (I) — (V), (a), (b), (e) (and others cited) of schemes of 



274 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


definition of a function which we have thus far agreed to call "recursions” 
possess two features: (i) They are expressed by equations in the manner 
which we analyzed formally (for (I) — (V) particularly) in § 54 . (ii) They 
are definitions by mathematical induction, in one form or another, 
except in the trivial case w r hen they are explicit definitions. 

The characterization of all "recursive functions” was accomplished in 
the definition of 'general recursive function’ by Godel 1934, who built 
on a suggestion of Herbrand. This definition succeeds by a bold general¬ 
ization, which consists in choosing Feature (i) by itself as the definition. 

We say then that a function 9 is general recursive, if there is a system 
E of equations which defines it recursively (§ 54 , with l — 0 ). 

This choice may seem unexpected, since the word "recursive” has its 
root in the verb "recur”, and mathematical induction is our method for 
handling recurrent processes. The meaning of the choice is not that 
Feature (ii) will be absent from any particular recursion, but that it 
is transferred out of the definition itself to the application of the definition. 
To show by finitary means that a given scheme has Feature (i), except 
in trivial situations, one will presumably have to make use of mathemat¬ 
ical induction somehow. But in defining the totality of general recursive 
functions, we forego the attempt to characterize in advance in what form 
the intuitive principle of induction must manifest itself. (By Godel’s 
theorem § 42 we know that the attempt at such a characterization by 
the formal number-theoretic system is incomplete.) 

In stating the Herbrand-Godel definition of general recursive function 
exactly, there is some latitude as to the details of the formalization, so 
that versions of the definition can be given which are equivalent to 
Godel’s but a bit simpler (cf. Kleene 1936, and 1943 §8). The present 
version is that of Kleene 1943, except for inconsequential changes in Rl 
and R2 which simplify § 56 slightly, and the inclusion of functions of 0 
variables in the treatment. (To relate the present treatment to Kleene 
1943, we note: (1) The inclusion of functions of 0 variables does not alter 
the notion of general recursiveness for functions of n > 0 variables. For 
one can show that, if an auxiliary function letter h occurs as a term 
with 0 arguments in the assumption equations, all occurrences of this 
term may be changed to k(c), where k is a new function letter and c a 
new variable, without altering the class of the deducible equations 
containing only the principal function letter. After this: ( 2 ) One can 
show in a few lines that exactly the same equations of the form 
f(Xj, ..., x„)=x, where f is a function letter and x v .... x„, x are nu¬ 
merals, are deducible from given assumption equations by the present 
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R1 and R 2 as by the R1 and R 2 of 1943; or with only a little more 
trouble one can carry out the treatment of §§ 54 and 56 with the R1 
and R 2 of 1943.) 

A function 9 is general recursive in functions <J>i, ..., <Jh, if there is a 
system E of equations which defines <p recursively from 4 i, ..<Jq (§ 54 ). 
This includes the definition of general recursive function as the case 
1 = 0 . For l > 0 (Kleene 1943), we are usually considering a scheme or 
functional 9 = ..., <]q) (§ 47 ) which defines a number-theoretic 

function 9 of n variables from <|q> .... for any l number-theoretic 
functions <jq, ..., °f m v •. variables respectively, or any such 
functions subject to some stated restrictions. Then if the E can be given 
independently of 4i> ..(for the fixed n, l, m v ..., mf), we say that 
the scheme F is general recursive, or that 9 is general recursive uniformly in 
<Jq, ..(jq. Since our treatment will always give uniformity in the 4 ’ s 
(subject to any restrictions stated), we usually omit the word “uniformly” 
except for emphasis. (Unlike the primitive recursive case § 47 , if the 
original scheme is for some restriction on tjq, ..., tjq, it is not implied that 
the scheme can necessarily be extended to a general recursive one de¬ 
fining a 9 without restriction on the <Jq, ..., t}q.) 

Using the present terminology to restate the results of Lemmas Ha 
and lie, we now have: 

Theorem II (second version). If 9 is defined from 4 i> by 

a succession of applications of general recursive schemes, then 9 is general 
recursive in i}q, .... ijq- 

In particular, Schemata (I) — (V) are general recursive. Hence: If 
9 is primitive recursive in tjq, ..., 4i, it is general recursive in <Jq, ..., y ; . 
Any primitive recursive scheme is general recursive. If 9 is primitive re¬ 
cursive, it is general recursive. 

The definition of general recursiveness has been stated for the case 
that the function 9 is already known, by intuitive use of the same 
equations which are formalized as the E, or by some other means. This 
anticipates our purpose of showing that various functions and schemes, 
known to us independently of the formalism of recursive functions, are gen¬ 
eral recursive (as we have just done for the primitive recursive functions 
and schemes). For the case that the 9 is not previously known, we then 
have: A system E of equations defines recursively a function of n varia¬ 
bles from ipi, ..., <4;, if for each n-tuple x v ..., x n of natural numbers, 
there is exactly one numeral x such that E” E h f(x x , ..x„)=x, 
where f is the principal function letter of E, and g v ..., g £ are the given 
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function letters in order of their occurrence in the given list of function 
letters. If so, the function cp which is defined recursively by E is the 
function whose value <p(x x , .. x n ) for the natural numbers x lt ..., x n 
as arguments is the natural number x for which that x is the numeral. 

As with primitive recursiveness (§§ 45 , 47 ), the notion of general 
recursiveness for functions extends to predicates and to mixed cases by 
use of the representing functions for the predicates. 

§ 56 . Arithmetization of the formalism of recursive functions. 

The formalism of recursive functions will be treated now as a general¬ 
ized arithmetic of the sort described in § 50 . We list below recursive 
definitions for this generalized arithmetic, as in § 51 . Simultaneously, 
we indicate how to pass to the simple arithmetic by Godel numbering, 
as in § 52 . 

The generalized arithmetic shall have six zeros, with respective 
Godel numbers, as follows. 

Zeros: = ' 0 a , f 

Correlated Godel numbers: 15 21 23 25 27 29 

We permit successors to be formed of any positive number of entities, 
i.e. all natural numbers are admitted now as values of s. 

As our list f, g, h, ... of function letters we use the entities f, 

(,, (|, f)), ... (sometimes written f, f u ...), constructed from the zero 
entity f in the same manner as the (number) variables from the zero 
entity a. 

A term of the form f(r x , ..., r„) where f is a function letter and 
r lf ..., r„ are terms (n ;> 0) will be represented as the entity (f, r x , .... r„). 
So in particular the function letter f and the term f(r x , ..., r n ) for n — 0, 
although usually written alike linguistically, are not treated as the same 
entity; in the generalized arithmetic, if the former is f, the latter is (f). 
(We still have a unique entity correlated to each significant linguistic 
object, if we consider that the omission of parentheses, when f(r x , ..., r n ) 
for n = 0 is written linguistically as “f”, is merely an abbreviation.) For 
example, the equation f= 0 is the entity (—, (f), 0). 

A system of equations e 0 , ..., e s is represented as the entity (e 0 , ..., e s ). 

In the passage from the generalized arithmetic to the natural number 
arithmetic, by Godel numbering, the clauses using a variable number 
s + 1 of predecessors are handled with the help of ##E and 20 (§ 45 ). 
For these clauses (and a few other definitions), we append the cor¬ 
responding number-theoretic clause (or definition). 
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Definitions of metamathematical predicates and functions for the 

FORMALISM OF RECURSIVE FUNCTIONS AS A GENERALIZED ARITHMETIC 

Dfl. y is a numeral. (Abbreviation: 9 i(y)-) Same as Dnl § 51 . 

Df2. y is a variable. (Abbreviation: SB(y)-) Same as Dn2. 

Df 3 . y is a function letter. (Abbreviation: ^S(y)-) Like Df 2 . 

Df 4 . y is a term. (Abbreviation: 2 rrt(y)-) 

1 . y X 0 . 

2. y is a variable. 

3 . y X r' (i.e. y X (', r)), where r is a term. 

4 . y X f(r lf ...,r„) (i.e. y X (f, r v ..., r B ), cf. § 50 ), where f 
is a function letter, and r v ..., r„ are terms (n > 0). 

FL((y)o) & (i)o<i<\Uv) Tm ^y)')- 

Df 5 . z is an equation. (Abbreviation: (£q(z).) 

1 . z X r=s, where r and s are terms. 

Df6. Z is a system of equations. (Abbreviation: ®@(Z).) 

1. Z X (Zg, ..., z s ), where z 0 , ..., z s are equations. 

\h{z )>0 & {i) i<me) Eq((z)i). 

Df 7 . (t is a term, x is a variable, e is a term or equation, and) d comes 
from e by the substitution of t for x. (Abbreviation: ®b(d, e, t, x).) 
1 . t is a term, x is a variable, e X x, and d X t. 

2 — 3 . t is a term, x is a variable, e is 0 or a variable x x, and d X e. 
4 . t is a term, x is a variable, and e is a term or equation of the 
form (e 0 , e^ ..., e„) and d is (d 0 , d 1( .... d m ), where m = n, 
e 0 X i, d 0 X eg, @b(d 1( e x , t, x), ..., @b(d n , e„, t, x). 

Tm(t) & V(x) & ( Tm(e ) V Eq(e)) &lh(e)> 0 &lh(<f)=lh(tf) & 
(e) 0 y ^27 & (^)g=(^)g & (*)o<i<lh(e)^((^}<> ( e )i’ x )‘ 

Df8. (e is a term or equation, x is a variable, and) e contains x. 
(Abbreviation: (£t(e, x).) Like Dn6. 

Df 9 . c is an immediate consequence of d (by R1). (Abbreviation: ( 5 n(c, d).) 
*1. d is an equation, and there exist a (variable) y and a numeral 
n, such that d contains y, and ®6(c, d, n, y). 

Eq(d) & ( Ey) y<d {En) n<c [N{n ) & Ct(d, y) & Sb(c, d, n, y)]. 

If d is an equation and (£t(d, y), then y is a variable < d; 
and if also ®f>(c, d, n, y), then n -< c. 

DflO. c is an immediate consequence of d and e (by R2). 

(Abbreviation: (£n(c, d, e).) 

* 1 . e X h(*j, ...,z v )=z, where h is a function letter, and 



278 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


z v ..., z v , z are numerals (/> > 0); d is an equation, con¬ 
taining no variables, call it d 1 =d 2 ; and c is of the form 
d 1 =c 2 , where, for some term u containing a, 

<Sb(d 2 , u, h(z 1 , ..., z p ), a) and ©b(c 2 , u, z, a). 

Eq(e) &FL((e) h0 ) & (i) 0 <i<mieh] N((e) hi ) & N((e) 2 ) & Eq(d) & 
(y)y<d Ct(d, y) & c= 2 15 * 3 (d)l - 5 (c>2 & (Eu) u<d [Tm{u) & 

Ct(u, 25 ) & Sb((d) 2 , u, (e) v 25 ) & Sb((c) 2 , u, (e) 2 , 25 )]. 

Since a has a smaller Godel number than any function letter, 
a has a smaller Godel number than h(z 1 , ..., z P ); and hence u 
has a smaller Godel number than d. 

Df 11. x is the numeral for the natural number x. (Abbreviation: 9 iu(x, x ).) 

Same as Dnl 1. 

Dfl 2 . (Z is a system of equations, and) Y is a deduction from Z (by 

Rl and R 2 ). (Abbreviation: %{Z, Y).) 

1. Z is a system of equations (z 0 , ..., z s ), and Y X (z*) (i < s). 
SE(z)&(Ei) i<Wz) [y= 2 ^]. 

2 . Y X (c, Yj), where Y x is a deduction from Z, and c is an 
immediate consequence of {Yj},,. 

3 . Y X (c, Y 1( Y 2 ), where Yj and Y 2 are deductions from Z, and 
c is an immediate consequence of {Y x } 0 and {Y 2 } 0 . 


Df 13 . Y is a deduction from Z of an equation of the form f(x x ,... ,x„)=x, 
where f is the principal function letter of Z, x 1( ..., x n are the 
numerals for the natural numbers x v ..., x n , respectively, 
and x is a numeral (as a predicate of Z, x v ..., x n , Y, for each 
fixed n ;> 0 ). (Abbreviation: ©„(Z, x lt ..., # n ,Y).) 

1. Y is a deduction from Z, {Y} 0 X f(x 1; ..., x„)=x, where 
f is a function letter, f X {zJi.o if Z X (z 0 , ...,z s ), etc. 
D(z, y) & lh((y) 0>1 )=n' & FL((y) 0>1>0 ) & (y)o,i,o=Wihte)-i,i,o & 
^«((y)o,l,l^l) & ■ • • & Nu ((y)o,l,n> x n) & -^((y)o,2)- 
x, if y is a numeral x (i.e. if 9lu(y, x)), 
the Godel number of y, otherwise. 

Nu~ l (y) = \ix x<y Nu(y, x). 

Df 15 . U(Y) = ^({Y}^). 

Then U(Y) = x, whenever Y is a deduction of an equation of 
the form r=x where x is a numeral. 

U(y) = A r «- 1 ((y) 0>2 ). 


Df 14 . ftu 


‘(y) = { 


By the methods illustrated in § 52 , and the indications accompanying 
these definitions: 
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Lemma III. For each of the predicates and functions defined by 
Dfl — Df 15 , the corresponding number-theoretic predicate or a corresponding 
number-theoretic function is primitive recursive. 


§ 57 . The (^-operator, enumeration, diagonal procedure. We 

shall now begin using the least number operator py (§ 45 ) without a 
bound on the y. Thus for a number-theoretic predicate R(y) such that 
(Ey)R(y), pyi?(y) = {the least (natural number) y such that R{y)}. For 
the time being, we use \iyR{y) only when the existence condition (Ey)R(y) 
is fulfilled. 

Thus we have a new schema 
(Via) <?{ x v ..., x n ) = p.y[x(%, y)— 0 ] 

for the definition of a function cp of n variables (n > 0) from any function 
X of n+ 1 variables such that 

(1 a) (*i) • • • (*») (Ey) [x(*x, ..., x n , y) =0]. 

By taking R(x v ..., x n , y) 3= x(*i» • • • > y)=° if X is given first, 

or if R is given first by taking x to be the representing function of R, 
we can also write the schema thus, 

(VIb) ?(*!, — y.yR(x i, y) 

as a definition of a function 9 from a predicate R such that 

(lb) (*0 ... (x n ){Ey)R{x 1, ..y). 

Theorem III. Schema (Via) with (la) holding ((VIb) with (lb) holding) 
is general recursive. Hence by Theorem II: A function 9 defined from 
functions and predicates Y by applications of Schemata (I) — (VI) with 
(1) holding for the applications of (VI) is general recursive in Y. (Church 
1936, Kleene 1936.) 


Proof. We recast (VI) into a schema (VI') suitable for translation 
into the formalism of recursive functions, as follows: 


(VI') 


7r(#i, ..., x n , y) IT x( x i> • • • > x n> $)> 

s<y 

x(z', 0, y) = y, 

9(*x, ..., x n ) = T(n(x lt y), n(x v ...,x n , y'), y), 


(vii) 

(Vli) 


where t (u, v, y) is an auxiliary function which is left undefined for u = 0 
or v > 0 . (For another simple method, see Kleene 1943.) 

First let us convince ourselves informally that (VI') is equivalent to 
(VI). Consider any fixed values of x v ..x„, and write simply “x(y)” 
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for "x(*i> ..., x n , y)”, etc. To illustrate, suppose that for y = 0 , 1,2, ... 
(first row below), yjy) takes the values shown (second row). 

y 01234567 ... 

x(y) 3 12 0 9 0 15 ... 

t z(y) = n x (s) 1 3 3 6 0 0 0 0 

s<y 

According to (VI), <p = \x.yR{y) = py[x(>')= 0 ], i.e. <p is the least value 
of y (first row) for which a 0 appears in the second row, namely 3 in this 
example. This number 3 is also identifiable as the unique y for which a 
successor (here 6) immediately precedes a 0 in the third row. Now (VI 2 ) 
gives this number and no other as value to <p, thus. Substituting 3 for y 
in the last equation and evaluating, 

<p = t(7t(3), 7t(4), 3 ) =-• t(6, 0 , 3 ) = 3 . 


If we substitute any other number than 3 for y in the last equation, we 
are unable to evaluate t. (If the example be changed so that (Ey)R(y), 
then we get no value for cp.) 

Now let E 2 be the system of equations obtained by translating (VI 2 ) 
using say p, t, f, a v ..., a n , b for ‘V’, ‘V’, “<p”, “*i”, ..., “x n ”, “y”, 
respectively. 

The following propositions (i) — (iv) treat the situation for any fixed 
n-tuple x v ...,x n , without using (1). For (i) and (ii), 7t must be the 
function defined from x or R by (Vl(). 

(i) If (Ey)R(x v ...,x n ,y), then E£, E 2 1- f{x v ...,x n )=x when x 
= \ryR{x v ..., x n , y). (ii) If E£, E 2 b f(x v ..., x n )=x where x is 
a numeral, then (Ey)R(x v ..., x n , y) and x — (xyi?(^ x , ..x n ,y). 

The proofs closely parallel the informal explanation. 

But is primitive recursive in x (#B § 45 ), so there is a system E x 
of equations which defines n recursively from x (Theorem II § 54 ). We 
can choose E x so that p is the principal function letter, h is the given 
function letter, and the auxiliary function letters do not occur in E 2 . 
Let E be E X E 2 . Then clearly E^, E (- f(x v ..x n )=x if E£, E 2 \- 
f{x j, ..., x n )=x; and the converse follows from Lemma lie §54 as 
in the proof of Lemma lid. Using this in (i) and (ii), and combining the 
results into one statement: 

(iii) E^, E b f(x v ..., x n )=x for some numeral x, if and only if 
(Ey)R(x j,’. y), in which case x = j xyR(x v ..., x n , y). 

Now using (1) with (iii), for each «-tuple x v .. .,x„, E^, E b f(x x , ..., 
x n )=x where x is a numeral, if and only if % = [iyR(x 1 , ...,x n ,y); 
i.e. E defines the function pi yR(x v ...,x n ,y) recursively from R. 
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Proof of Theorem IV. Now suppose that R{x lt ..x„,y) is general 
recursive. Let D be a system of equations which defines recursively the 
representing function x of R, with h as principal function letter, and with 
auxiliary function letters not occurring in E. Let F be DE. Now: 

(iv) F |- f(x lt ..x n )=x for some numeral x, if and only if 
(Ey)R(x v ..x n , y), in which case x — \xyR{x v .. x„, y). 

Using Dfl3 § 56, we state this result (omitting the final remark) in 
symbolic form, thus: 

( 2 ) (Ey)R(x v y) = (£Y)@„(F, * lf ..Y). 

Now let / be the Godel number of F. By the definition of ‘the cor¬ 
responding number-theoretic predicate’ § 52, S n (f, x v ..., x„, y) = {y is 
the Godel number of an entity Y such that <S n (F, x v Y)}. Hence 

(£Y)@„(F, x lt Y) = (Ey)S n (f, x v ...,x„, y). So ( 2 ) gives 

(3) (jBy)2?(*j, ...,x n ,y) m (. Ey)S n {f , x lt .... x„, y). 

By Lemma III, S n is primitive recursive. 

In stating the result in the theorem, we replace S n by the predicate 
r„ defined from it thus, 

Xy, . . «, X n , y) = S n (z, Xy, • • *i X ni y) &. Xy, * • •> %n> 0* 

For given z, x v .... x n , T n (z, x x , ..x n , y) is true for at most one y 
(cf. *174 §41). The advantage of using T n instead of S n as the basic 
predicate of the theory will appear in § 58. By # #D and E, T n is also 
primitive recursive. By the informal counterparts of &-elim., *70 § 32 
and *149a §40 (noting that, since S n is recursive, 

S n (z, Xy, x n , y) V S„(z, Xy, x n , y) 
intuitionistically, as will be discussed in §§ 60 and 62), 

(4) F n (£, Xy, • • •> x ni y) > S n (z, Xy , *. *, x nt y), 

(5) (. Ey)T n (z , x v .... y) = (Ey)S n {z, x n ,y). 

Theorem IV. For each n > 0: Given any general recursive predicate 
R(xy, ..., x n , y), numbers f and g can be found such that 

( 6 ) (Ey)R{xy, y) * (Ey)T „(/, x n , y), 

(7) {y)R{x v ...,x n ,y) = {y)T n (g, Xy, ...,x n , y), 

where T n (z, x x , .... x n , y) is the particular primitive recursive predicate 
defined above. Similarly with more quantifiers ; e.g. given a general recursive 
predicate R{a lt ..., a n , x, y), there is a number g such that 

( 8 ) (Ex)(y)R{a lt x, y) = {Ex){y)T n+1 (g, a v .... a n , x, y). 

(Enumeration theorem, Kleene 1943 .) 
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Proof (completed). Formula ( 6 ) follows from (3) by (5). To infer 
(7), we apply ( 6 ) to the predicate R(x 1 , . .x n , y), calling the / for this 
predicate “g”, and transform the result by the informal counterparts 
of *30 § 26 and *86 § 35 to 

{y)R{x v ...,x n , y) = (y)f n (g, x v ..., x„, y). 

Since R is general recursive, we have R V R, and hence (cf. *49c § 27) 
R = R. For ( 8 ), we apply (7) with n +1 as the n of (7), a lt ..., a ni x 
as the x v ..., x„, and y as the y, and use the informal counterpart of 
*72 §33. 

Discussion. By this theorem, we obtain an enumeration (with 
repetitions) of the ^-variable predicates of the form ( Ey)R(x 1 , ..x n ,y) 
where R is general recursive by taking z = 0, 1 ,2, ... in the fixed «+ 1 - 
variable predicate (Ey)T n (z, x v '..., x n , y) of like form. Briefly, 
(Ey)T n (z, x v ..., x n , y) ‘enumerates’ the predicates of the form 
(Ey)R(x 1 , ..., x n , y) with a general recursive R. Similarly, 

( y)T n (z , x v ...,x n ,y) enumerates the predicates of the form 
( y)R{x v .. ., x n , y) with a general recursive R \ etc.—Since T n , T n , T n+1 , 
etc. are primitive recursive: 

Corollary. The class of the predicates expressible in a given form 
consisting of a fixed succession of one or more quantifiers prefixed to a 
predicate R is the same whether a general recursive R or a primitive recursive 
R be allowed. (Kleene 1943 , generalizing a lemma of Rosser 1936 p. 87.) 

We now use the above enumerations as the basis for an application 
of Cantor’s diagonal method to prove the next theorem. Given a general 
recursive predicate R(x, y), and using ( 6 ) for n — 1 , there is a number 
/ such that 

(9) (Ey)R(x,y)^(Ey)T 1 (f,x,y). 

Substituting / for x in this equivalence, 

(10) (Ey)R(f,y)^(Ey)T 1 (f,f,y). 

Thence, using the informal counterpart of *50a § 27, 

(11) (Ey)R(f,y)^{Ty)T 1 {f,f,y). 

Formula ( 12 ) of the theorem follows, using the informal counterpart of 
*86 § 35. To prove (13), using (7) we have 

(y)#(g. y) = (y)Ti(g, g, y) = {Ey)T x {g, g, y) & {Ey^^g, g, y). 

To infer (14) and (15), given any general recursive R(x), let R{x,y) = 
R(x)&y=y or R(x, y) = R(U\(x, y)) (§44), so that R{x,y) is general 
recursive and R(x) = ( Ey)R(x , y) = (y)R(x,y). 
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Theorem V (Part I). Given any general recursive predicate R(x y y) t 
numbers f and g can be found such that 

( 12 ) (Ey)R(f,y)*(y)T 1 (f,f,y), 

(13) (y)R{g, y) * {Ry)T l {g, g, y). 

A fortiori , given any general recursive predicate R(x), numbers f and g 
can be found such that 

(14) RM^MUf.f.y), 

( 15 ) R(g) & {Ey)T x {g, g, y). 

Discussion. By this theorem, ( y)T 1 (x , x, y) is an example of a pred¬ 
icate of the form (y)i?(#, y) with a recursive R which is not expressible 
in the dual form ( Ey)R(x , y) with a recursive R. That is, 

(x)[(y)T 1 (x, x, y) s ( Ey)R{x, y)] 

cannot hold for any general recursive predicate R(x, y ); and for a given 
R(x,y), the / of (1 2 ) is a value of x which refutes it. A fortiori, (y) T 1 (x, x, y) 
is also not general recursive (cf. ( 14 )). 

The proof (above) amounts simply to this: (Ey)T x {z, x, y) for z — 0 , 
1, 2, ... is an enumeration (with repetitions) of all predicates of the form 
(Ey)R(x, y) with R recursive. By Cantor’s diagonal method, (EyjT^x, x, y) 
is a predicate not in the enumeration. The latter is equivalent to 
(y)Ti(x, x, y). 

From the enumerability of all systems E of equations, we can conclude 
without the present theory that the general recursive predicates are 
enumerable, and so also the predicates of the form (Ey)R(x, y) with R 
recursive. Hence by Cantor’s results (§ 2 ), they cannot constitute all 
number-theoretic predicates. The additional content of the present 
theorem is that an example of a predicate neither general recursive nor 
expressible in the form (Ey)R(x, y) with R recursive is given which is of 
the dual form (y)R(x, y) with R recursive; and vice versa. 

To save space, we write the next part of the theorem for predicates of 
one variable a ; but it holds likewise for predicates of n variables a x , ..., a n 
for each n ;> 1 . In the proof of (b) we use a classical equivalence, not 
available to us intuitionistically. We accordingly label (b) with a “ c ” 
(cf. § 37 ). The results so labeled in this chapter are all on the number- 
theoretic level. 

Theorem V (Part II). Consider the predicate forms 
Rid] ( Ex ) R ( a ’ x ) {x){Ey)R{a, x,y) {Ex){y){Ez)R(a, x, y, z) ... 

U (x)R(a,x) (Ex)(y)R(a, x, y) (x)(Ey)(z)R(a, x, y, z) ... 



284 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


where the R for each is general recursive. To each form with k -j -1 quantifiers 
(k > 0 ), there is a predicate expressible (a) in the negation of the form 
((b) c in the other k + 1 -quantifier form), but not in the form itself nor in 
any of the forms with <; k quantifiers. By Part I, the “ c ” is unnecessary 
for= 0. (Kleene 1943 ; Mostowski 1947 .) 

Proof. For example, take k = 1 . From ( 8 ) (for n — 1 ) we infer 
(16) (Ex){y)R{g, x, y) ^ {E~x){y)f 2 {g, g, x, y), 

just as from ( 6 ) we inferred (11). Thus (Ex)(y)T 2 (a, a, x, y) (equivalent 
classically to ( x)(Ey)T 2 (a, a, x, y), cf. Theorem 18 § 35) is not expressible 
in the form (Ex){y)R(a, x, y). Similarly, ( x)(Ey)T 2 (a, a, x, y) (equivalent 
classically to ( Ex)(y)T 2 (a , a, x, y)) is not expressible in the form 
(x)(Ey)R(a, x,y). A fortiori, these predicates are not expressible in any 
of the three forms with one or no quantifier. 

Theorem VI. For each n ^ 0: (a) Every general recursive predicate 
P(x v ..., x n ) is expressible in both of the forms ( Ey)R(x v ..., x n ,y) 
and (y)S(x v ..., x„, y) where R and S are primitive recursive. (b) c Con¬ 
versely, every predicate expressible in both of these forms where R and S are 
general recursive is general recursive, (c) A predicate P(x v ..., x n ) is 
general recursive, if and only if P(x lt ..., x n ) and P(x v ...,*„) are each 
expressible in the form ( Ey)R(x lt ..., x„, y) with a general recursive R, and 
(*i) • • • (*«)[P(*i, • •*„) V P(x v .. .,*„)]. (Kleene 1943 , Post 1944 , 
Mostowski 1947 .) 

Proofs, (a) Take P{x v ..., x n , y) = P(x x , .. x n ) & y=y, so that 
P(x v ...,x„) = (Ey)P(x v ...,x n ,y) = (y)P(x 1 , y), and apply 

Corollary Theorem IV. (b) Suppose P[x lt ..., x n ) = ( Ey)R(x 1 , ..., x n , y) 
= {y)S(x v y). Then by classical logic (cf. *85 § 35), P(x v ..., x n ) 

= ( Ey)S(x 1 , ... ,x n , y). By the classical law of the excluded middle, 
P(x v ..., x n ) V P(x v ...,x n ). Hence 
P(*!, ...,*„) = R(x v ...,x n , { iy[R(x v ...,x n ,y)V S(* 1( ..., y)]), 

where the second member is general recursive by Theorem III. (c) The 
last hypothesis makes the statement valid intuitionistically. 

In Kleene 1943 , 1944 , a predicate was called "elementary”, if it can 
be expressed explicitly in terms of constant and variable natural numbers, 
general recursive predicates, the operators ->,«&, V, — of the proposi¬ 
tional calculus, and the quantifiers, combined according to the usual 
syntactical rules. 
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Theorem VII. (a) Every arithmetical predicate (§ 48) is “elementary”, 
and (b) conversely, (c) Every predicate expressible by means of quantifiers 
prefixed to a general recursive predicate is expressible in one of the forms of 
Theorem V with a general recursive R. (d) c Every arithmetical predicate 
is expressible in one of the forms of Theorem V with a general recursive R. 

Proofs, (a) By ## 1 , 2 , 14, C (§§44,45), every predicate formed 
explicitly using +, •, = is primitive recursive, and hence general re¬ 
cursive. (b) By Theorem VI (a) and Corollary Theorem I § 49. (c) As x 
ranges over all natural numbers, the m+\ -tuple of primitive recursive 
functions (x) 0 , ...,(x) m (#19 §45) ranges (with repetitions) over all 
m+ 1 -tuples of natural numbers. Therefore 

(17) {Ex 0 ) ... {Ex m )A(x 0 , ...,x m ) = (Ex)A({x) 0 , ..., ( x ) m ), 

(18) (% 0 ) • • • {x m )A(x o, ..., x m )= [x)A{{x) 0 , .... (x) n ). 

We use these to contract consecutive quantifiers of the same kind, 
(d) By (a), the informal counterpart of Theorem 19 § 35, and (c). 

Remark 1 . For any predicate A (i, x ), 

(19) (i) t< SEx)A(i.x) = (Ex)(i) l<a A[i, (*),). 

By substituting A {i, x) for A (i, x) and using (the informal counterparts 
of) *30, *85, * 86 , *58 and *49 (cf. the proof of Corollary Theorems 8 
and 18), we infer classically the dual 

(20) c {Ei) i<a {x)A{i, x) s [x){Ei) i<a A(i, (*),). 

(But (20) does not hold in general intuitionistically, by Example 4 § 82.) — 
Simply by *95 and *77, {i) i<a {x)A{i, x) = [x){i) i<a A{i, x) \ and similarly 
f o r {Ei) i<a {Ex), {i)i <a {x) x<b and (Ei) i<a (Ex) x<b . 

Although there is a substantial difference between the notions 
‘arithmetical’ and ‘elementary’, which requires two basic theorems 
to bridge (Theorems I and IV), henceforth in the interest of unifying 
terminology we shall usually say “arithmetical” (even when primarily 
we have the other notion in mind). 

Example 1 . “Elementary” is used by Kalmar 1943 * in another 
sense, equivalent to the following. A function is “elementary”, if it can 
be expressed explicitly in terms of variable natural numbers, the constant 
1 , the functions +, • and [a/b], and the operations 2 and II. Extending 

y<z y<z 

the notion to predicates, and to the case of assumed functions and 
predicates, in the familiar way, we can recognize successively that the 
following of our list ## 1—21 of primitive recursive functions and predi¬ 
cates (§§ 44,45) are elementary, and the following of our results # #A—G 



286 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


hold reading "elementary” in place of "primitive recursive”: # 1 , # 2 , 
#13, ##A—C, #3, #4, #9 (sg'(«)= [l/(a+l)]), # 10 , #D, #E 
(II is II ), #15 (the representing function is sg[(«+ l)/(i+1)]), #6 

8<i S<t +1 _ 

(a-b = [LC cSa [b+c<a V a<b]), #5, #7, # 8 , #11, #12 (rm (a,b) 
= a—b[a/b]) t #14, #F, #16, #17, #19 as a function of a for each 
fixed i\ also Godel’s [3-function (§48). — Next we show that (A) if 9 
comes from x {f rom x) by a primitive recursion (Schema (V)), and 
9 (y, x 2 , ..., %„) < h]{y, x 2 , ..., x n ), then 9 is elementary in ip, x, ?) (in x, rj). 
In the proof of Theorem I § 49 Case (Vb), (B) is of the form 
(Ec)(Ed)R(y, x 2 , ..x n , w, c, d), where R is elementary in and x, and 
(c)(d)[R(y, x 2 , ...,x n ,w, c, d) 9 (y, x 2 , ..x n ) = w]. Hence 
9 (y, x 2 , x n ) = ((. UR(y , x 2 , ...,x n , ( t ) 0 , (*) 1( (t) 2 )) 0 . So by #E it will 
suffice to find a bound elementary in vj on the t. But by § 48, we can 
choose d = (max(y, a 0 , ..a y ))\ where a t — 9 (i, x 2 , .. .,x n ), so that 
d <, D where D — (y+ S r\(i, x 2 , .. .,x n ))\, and c < II 8(d, i) — 

n (l + (* + l)rf) ^ C where C = II (1 + (1 + 1 )D). So we can take 
iSy i<v 

t < 2 e -2> c -5 d where E — yj(y, x 2 , ..x n ). — Now^ (#18) is elementary, 
for an elementary bound can be found, e.g. p t < 2 2i (= only for i = 0 ), 
as we prove by course-of-values induction on i, thus. True for i — 0, 1 . 
For i ;> 2 , pi <. po'Pi'• • •' Pi -1 — 1 (by reasoning as in the proof 
of Euclid’s theorem §40) < 2 2 °-2 21 2 2<_1 (using the hyp. ind.) 

— 2 2 °+ 2l+ -+ 2l_1 = 2 2 *-i < 2 2 \ — Then the following are elementary: 
#19, #20 (lh(f, a) < a), # 21 . Also (cf. #G) (B) if 9 comes from 
X by the course-of-values recursion (3) §46, and 9 ( 9 , x 2 , ..., x n ) < 
y)(y, x 2 , .... x n ), then 9 is elementary in x, t). For then II p\l {i,X2 Xn) is 

^ } 4< y 

a bound elementary in yj for 9 ( 9 ; x 2 , ..., x n ) in the primitive recursion 
(4) § 46. — It follows that the number-theoretic predicates and functions 
corresponding to the metamathematical ones defined by Dnl—Dnl3a 
§ 51 (cf. Lemma 19 § 52) and Dfl—Dfl5 § 56 (cf. Lemma III) are elemen¬ 
tary. For recursions are used in those metamathematical definitions in 
the generalized arithmetic only to introduce (the representing functions 
of) predicates, and so (B) applies to each corresponding recursion in the 
simple arithmetic with tj = 1 . In particular, S n (Dfl3), and hence (C) 
T n (preceding Theorem IV), and U (Df 15) are elementary. This corresponds 
to the result of Ilona Bereczki ( 1949 * unpublished) that (by means of 
devices used in Kalmar 1943 ) each of the predicates and functions 
shown in Kleene 1936 to be primitive recursive is either elementary or 
can be replaced by one which is elementary without deranging the 
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arguments of that paper. — So (D) we can add “or an elementary R 
{in Kalmar’s sense)” in Corollary Theorem IV. — Kalmar ( 1943 , 1948 , 
1950 , 1950 a) uses his elementary functions in presenting Godel’s theorem 
and other results which are presented in this book using primitive re¬ 
cursive functions. — Miss Bereczki ( 1952 *) shows that the primitive re¬ 
cursive function h a defined by °a — 1 , b 'a = a< 6a > (so that b+1 a = 
<” 3 (b, a) § 55) is not elementary because “a grows too fast as a increases. 
She obtains another example of a non-elementary primitive recursive 
function by constructing a primitive recursive enumerating function 
<p(n, a) for the elementary functions of one variable (cf. § 55). From this 
it follows that there are non-elementary primitive recursive predicates 
(similarly to Example 1 § 55). 

Theorem VIII c . For the primitive recursive V and u defined below, 
the predicate M(a, k) defined by induction on k thus, 

M{a, 0 ) ^ V[a), 

■ M{a, 2k-\-\) ss {Ex)M{x>{a, x), 2k), 

M{a, 2k-\-2) = {x)M(u{a, x), 2k-{-\), 
is not arithmetical. (Kleene 1943 .) 

Kalmdr first obtained a result of this sort, which appears in Skolem 
1936-7 pp. 86 ff. Also cf. Wang 1953 , Mostowski 1951 . 

Proof (optional). LetusintheexpressionforS „(£,*!, .. x n ,y) (Dfl3 
§56) replace *V” by “(x)'”, and “Nu((y) 0 hl>Xl ) &...& Nu{{y) 0>hn ,x n )” 
by "(*)i<* <( X ) 0 Ku{{y) o li , (*)f— 1)”. The result is a primitive recursive 
predicate S{z,x,y) such that 

(21) S{z, 2 n ■ pfpl» ■ pi, y) = S n (z, x lt ...,x n ,y). 

Let T(z, x, y) = S(z, x, y) & (t) i<v S(z, x, t). Then 

(22) T(z, 2 n ■ />?'•... • p*»-pi, y) = T n (z, x lt ..., x n , y). 

Now let F(a) = r((«) x — 1 ,«, («) (a)o ,— 1 ). Then for n>\, 

(23) V(2 n ■ pf-...■ p*/-p^ ^ T n (x v x v ..., x n , y). 

Let u {a, x) = a*2 x ' (cf. #21 § 45). Now M{a, k) takes for k = 0, 1,2, ... 

the values V{a), ( Ex)V{a*2 x '), {x){Ey)V{{a*2 x ')*2 v ‘), -Suppose M{a, k) 

were arithmetical. By Theorem VII (d), then it would be expressible in 
one of the forms of Theorem V. For illustration, suppose M{a, k) = 
(Ex)R{a, k, x) fora recursive R. Then {Ex)R{ 2 2 -3 a ', 2, x) = M{ 2 2 -3 a ', 2) = 
[x) ( Ey) F((( 2 2 • 3 a )*r)*2 v ) = (x) {Ey) V{2 2 ■ 3“'- 5 31 '- 7 V ) s (x) (Ey) T 2 (a, a, x, y). 
But (Ex)R(2 2 -3 a ', 2, x) is of the form (Ex)R{a, x) with R recursive; and 
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in the proof of Theorem V Part II we saw that (x)(Ey)T 2 (a, a, x, y) is 
not expressible in this form. 

The reader may pass if he wishes to §§ 60—62, which do not depend 
except incidentally on §§ 58 and 59. But § 58 contains a result which 
is fundamental for §§ 63—66 and later. 

§ 58. Normal form, Post’s theorem. Using Dfl3 and Dfl5 (§ 56), 
we can restate the definition of ‘general recursive function’ (§§ 55, 54) 
as follows. A function ?(x v ...,*„) is general recursive, if and only 
if there exists a system E of equations (without given function letters) 
such that 

(24) (x,) ... (* n )(EY)© n (E, * x , ..., * n , Y), 

(25) ( Xl ) ... (*„)(Y)[@„(E, Y) -* tt(Y)=?(#!, . ..,*«)]. 

On passing over from the generalized arithmetic to the simple arithme¬ 
tic by the Godel numbering, <B n becomes S n , tt becomes U, E becomes its 
Godel number e, and (24) and (25) give 

(26) (^i) ■ • • {x*i)(Ey)S nt (c, Xy, ..., x n> y), 

(27) (xy) ... (x n )(y)[S n (e, Xi, • ■x n , y) —*• U(y)—<p(Xy, ..., x n )]. 

By Lemma III, U as well as S n is primitive recursive. 

It follows from (26) and (27) that the function 9 (x x , ..., x n ) can be 
expressed in terms of the number e thus, 

(28) <p(x v ...,x n ) = U([LyS n (e, x v y)). 

By ( 5 ) and ( 4 ) § 57 , now (26) and (27) and therefore also (28) remain 
valid when S n is replaced by T n . 

Theorem IX. For each n ;> 0 : Given any general recursive function 
9 (x x , ..., x n ), a number e can be found such that 

(29) (*j) ... (x„)(Ey)T n (e, %,..., x n , y), 

(30) • • •, x n ) = U ([ry 2 n (e, x-y, ..., x n , y)), 

(31) (x x ) ... {x n )(y)\T n (e, x-y, ■ ■ ■ > x„, y ) —>■ U(y)—<f>{Xy, ...,x n )], 

where T n (z, x v .... x n , y) and U(y) are the particular primitive recursive 
predicate and function defined above. (Normal form theorem, Kleene 1936 , 
I 943-) 

The advantage of using T n instead of S n is that (31) holds for any 
number e such that (29/ and (30) hold (whereas (26) and (28) might hold 
and (27) be false, when e is the Godel number of a system E of equations 
which lacks the consistency property for defining 9 recursively). 
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We now say, for any general recursive function 9, that any number 
e (whether or not it is the Godel number of a system E of equations 
defining 9 recursively) such that ( 29 ) and ( 30 ) (and hence ( 31 )) hold 
defines 9 recursively or is a Godel number of 9. 

A number e defines recursively (or is a Godel mimber of) a general re¬ 
cursive predicate P(x v ...,#„), if it defines recursively the representing 
function of P. In this case, 

(32) p ( Xv •••>*")- »(*» *1.*«• y) & v(y )=°] 

1 . y) -* u(y)=0]. 

To prove constructively that a function 9 is general recursive, one 
must exhibit (or imply a method for obtaining) equations E which define 
9 recursively. Thus to give a general recursive function effectively means 
to give an E, or now a Godel number e. 

The theory of the Godel numbers of recursive functions will be treated 
in the next chapter (§ 65). 

Example 1 . Is every general recursive function y(x lt ...,%„) ex¬ 
pressible in the form \LyR{x v y) where R is primitive recursive 

(and (lb) hplds)? Hint: Use Example 1 §55 and #E §45. (A different 
method was used by Post 1946 a.) — Call a function 0(y) “universal”, 
if for each general recursive function (f>(x lt ...,*„) there exists a primitive 
recursive predicate R(x v ..., x n , y) with (lb) holding such that 
9 (x 1( ...,*„) = Q([LyR(x v y )); and call 0(y) “of large oscillation”, 

if {x)(z){Ey) y> £(y)—x. Markov 1947 c, 1949 shows that a sufficient 
and classically necessary condition that a primitive recursive function 0 
be universal is that it be of large oscillation. Kuznecov 1950 announces 
further results concerning such functions 0 . 

Corollary. Every general recursive function 9 is definable by ap¬ 
plications of Schemata (I) — (VI) with ( 1 ) holding for the applications of 
(VI). (Converse of Theorem III, for Y empty.) 

For simplicity, beginning with Theorem IV, we confined our attention 
to functions and predicates general recursive absolutely, i.e. recursive 
in ..., for 1 = 0. Now we shall extend the theory tc relative general 
recursiveness, i.e. to / > 0 . 

We shall find that Theorem IX and the earlier results hold, if instead 
of a primitive recursive predicate T n , we use one primitive recursive 
in tjq, .... Let us work first with the case that l = 1 and ^ (= <X) 
a function of one variable (m x = 1 ). 

By definition (§ 55), 9 is general recursive in <j;, if there is a system E 
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of equations which defines cp recursively from ^ (§ 54). If there is any 
such E, then we can choose one in which the given function letter g 
(with Godel number g) is the first in the fixed enumeration of function 
letters. 

Now we state appropriate modifications of Dfl2 and Dfl3 §56. 

Dfl 2 * (for l = m 1 = 1 ). (Z is a system of equations, and) Y is a de¬ 
duction from E^, Z (by Rl and R2). (Abbreviation: Y).) 

*0. Z is a system of equations; and for some natural number 
u v Y is (g (u 1 ) = u) where u = ^(tq). 

SE(z) & y = 2 exp 2 15 • (3 exp 2°- 3 <y )°,i,i) . 5 M#.* & 

(£*h) Ml <y[W«((y) 0 ,i,i>%) & ^«((y) 0 , 2 . ^K))l- 
1 —3. Like 1 —3Dfl2, except reading "from E^, Z” in place of 
"from Z”. 

Dfl3* (for l = m 1 = 1 ). Like Dfl3, reading "from E^, Z” in place of 
"from Z”, and “D'\z, y)” in place of " D(z,y)”. 

(Abbreviation: ©^(Z, x v ..., x n , Y).) 

The reasoning used before to show that D(z, y) and S n (z, x v ..x„, y) 
are primitive recursive (Lemma III § 56) shows now that D^(z, y) and 
S^(z, x v .... x n , y) are primitive recursive in <j;; and in place of (26) and 
(27) for a general recursive function <p(tf 1( .. x„), we now have for a 
function <p(z 1( ..., x„) general recursive in 

(33) (*i) • •. ( x n )(Ey)S%(e, x lt x ni y), 

(34) (xj ... (x n )(y)[S%(e, x v y) -► £7(y) = <p(#i, . .*„)j. 

We could continue as before; but we shall show also that the depend¬ 
ence on ^ can be given a special form. 

The course-of-values function <j)(y) (= II pf (i \ §46 ( 1 )) for (j/ is prim- 

%<v 

itive recursive in ^ (##A, B, 3, 18, §§ 44, 45). 

Using § 46 ( 2 ), <^(%) in ODf 12* can be written as (<f(y)) Ul , since iq < y, 
or even as ($(u))„ for any v > y. Let D l (w, z, y) be the predicate obtained 
in place of D^(z, y) when we replace (J'(wi) in 0Dfl2* by {w) u . Then D 1 
is primitive recursive; and by course-of-values induction on y (with cases 
corresponding to the four clauses in the recursive definitions of D 1 and 
D'K cl § 52), 

(35) . (v) v > y m%(v),z,y) = D±( Z ,y)]. 

Using D 1 (w,z,y) in place of D^(z,y) in Dfl3*, we obtain a primitive 
recursive predicate S* (w, z, x v .... x n , y) such that 

( 36 ) i v )v> v [ s i(M v )> x i> •••.*», y) = s %( z > x i> • • •> *»> y)]- 
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We define T\ and T% thus, 

T l n (w, z,x v .. .,x n ,y) = S\(w,z > x v ...,x n ,y) & (Z) i<2/ S*K z, x v . ..,*„,/), 
T t( z > x v .... x n , y) s T l n ${y), z,x lt .... x n , y). 

Then T\ is primitive recursive, and T\ is primitive recursive in 4- Using 
(36) first with y, y and then with y, t as the v, y, 


(37) Tl(z, x v .... x n , y) = S£(z, x v x n , y) & {t) t<y S'i(z, x v ..., x n , (). 


Now we can continue from (33) and (34) with (37), as we did before 
from (26) and (27) with the definition of T n (preceding (4) § 57). 

Our statements here are to be understood intuitionistically as con¬ 
sequences of the hypothesis that particular values of 4 are available on 
demand. This hypothesis justifies e.g. expressing a given value v( ; h) as 
($( v ))u 1 for a given v > u v which requires the other values among 
4(0), . .., 4(^—1); and by it 

x v ...,x n ,t)\/ sf t (z, x lt x,„ /)] 


for a given y, which requires the values 4(0), .... 4(y—1) (cf. the proof 

of (5)). 

The results obtained in § 57 beginning with Theorem IV extend 
similarly. " 

The case of any l functions of m v ..m t variables, respectively, can 
be reduced to the case l — (by taking 4 *{x) = 4i((*)i> • • •> 

4 = ... • Pp ); but is also easily treated directly. For example, if 

l = tn 1 = 2, m 2 = 0, we define the course-of-values function 4i f° r 4i 
(in both variables) thus, 4i(y> z ) — n pi exp (II p } exp 4i(b j)), so that 

„ ! <i/ ><- ^ 

4 x (s, t) = (4x(y» ^)) s ,t if s<y & t<z. We define 42 = 42- Dfl2* and 
Dfl3* are formulated for / = m x = 2, m % — 0; and we introduce succes¬ 
sively predicates D 2,0 , S 2,0 , T 2,0 , and let 


Tp x„..., x n , y) = T^iy, y), 4 2 , z, ^..v,„ y). 


In general, we come out with a primitive recursive predicate T"' 1 m i 

of variables, and a predicate Tp ^ of «-j-2 variables 

primitive recursive in 4i, • • •, 4t- 

When T is a list of l functions and predicates, the foregoing applies by 
taking as 4i» • • ■» 4; the list obtained from T by replacing the predicates 
(if any) by their representing functions (cf. end § 55); and we then write 
Tp .also as ?*\ 

The definition of ‘arithmetical predicate’ (§ 48) is adapted to Z > 0 by 
adding the functions 4i. . .4( to the initial functions + and •; that of 
"elementary predicate’’ (§ 57 preceding Theorem VII) by substituting 



292 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


"general recursive in Y” for "general recursive”. (‘Uniformity’ can be 
defined in the usual manner, and the resulting relationships in our 
theorems are uniform when the given ones are uniform.) 


Theorem X. Let l, m x , ...,m x be fixed numbers ;> 0 , and Y be l 
functions and predicates of m v ..., m t variables, respectively. Then Theo¬ 
rems I, IV — IX and corollaries hold good reading “general recursive in Y”, 
“primitive recursive in Y”, “arithmetical in Y”, “elementary in Y”, 
“Tj’\ “V' ¥ ", in place of “general recursive”, “primitive recur¬ 

sive”, “arithmetical”, “elementary”, “T n ”, “V”, “M”, respectively (i with 
U and u unchanged ). 

The theorems I, IV — IX thus extended we cite using stars. For ex¬ 
ample: Theorem IX*. For each n>. 0 : Given any function <p(%, ..., x n ) 
general recursive in Y, a number e can be found such that 


(38) (x x ) ... (x n ){Ey)Tj{e, x v .. ., % n , y), 

(39) <p(*i> ...,x n ) = U (fxyrj (e, x v ..., x„, y)), 

(40) ( Xl ) ... (x n ){y)[Tj(e, x v ...,x n , y) - U(y)~ 9 (x v .. .,x n )], 

where U(y) is the primitive recursive function, and T^(z, x lt ..., x n , y) 

the predicate primitive recursive in Y, defined above’, e.g. for l = m 1 

= 1, Tj(z, x v ..., x n , y) = P* ($(y), z, x v .... x„, y) where 

T\(w, z, x v ..., x n , y) is the particular primitive recursive predicate defined 

above. 


A number e such that (38) and (39) (and hence (40)) hold is said to 
define 9 recursively in (or from) Y or to be a Godel number of 9 from Y 
or of the functional 9 = F(Y). The notion extends to a predicate P with 
representing function 9 as before. 

Corollary, (a) Every predicate P general recursive in arithmetical 
Predicates Y is arithmetical. ( Similarly, if P is general recursive in Y, 0, 
and Y are arithmetical, then P is arithmetical in 0.) (b) c The predicate 
M(a, k) of Theorem VIII is not general recursive in any arithmetical 
predicates. 

Proof of (a). By Theorem VI* (a), P is expressible by a quantifier 
prefixed to a predicate R primitive recursive in Y. But by Corollary 

Theorem I*, R is then arithmetical in Y; so since Y are arithmetical, 
R and hence P is arithmetical. See Note 1 on p. 316. 

Under the definition of “elementary predicate” (preceding Theorem 
VII § 57), the general recursions were applied only before the logical 
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operations. The meaning of Corollary (a) is that permitting general re¬ 
cursive operations at all stages (i.e. interspersed with the logical operations 
in any manner) does not lead to a larger class of predicates. 

It is natural to inquire just what happens to predicates falling in a 
given place in the scale of Theorem V Part II (cf. Theorem VII (d)) when 
general recursive schemes are applied to them. 

The distinction (for k > 0 ) between the two ^-quantifier forms is 
lost when primitive recursive operations are applied. For example, 
although (Ex)7.\(a, a, x) is not expressible in the other one-quantifier 
form {x)R{a, x), its negation (Ex)7\(a, a, x) is primitive recursive in 
it (#D §45), and assumes that other form (cf. *86 §35). For any 
predicate P(a), the representing predicate P(a, w) (§ 41) of the represent¬ 
ing function (§ 45) of F(a) can be expressed thus, 

(41) P(a, w) ss {P(a) & w= 0} V { P(a) & w— 1). 

This is primitive recursive in P(a). When P(a) - (. Ex)T x (a ., a, x), P(a, w) 
is expressible in neither of the one-quantifier forms: e.g. if P(a, w) = 
(. x)R(a , w, x), then (x)R(a, 0, x) = P(a, 0) = P(a) = (Ex^^a, a, x), which 
contradicts Theorem V if R is recursive. 

Mostowski 1948 a gives (classically) an example of a predicate, which 
(by (b) of the next theorem) is general recursive in predicates of the 
1 -quantifier forms, but which cannot be expressed in terms of predicates 
of the 1 -quantifier forms by the operations of the propositional calculus. 

The following theorem and corollary are Post’s on the basis of an 
abstract ( 1948 ), as the author became aware after working out the 
present treatment (in 1949), 

Theorem XI C (a) If a predicate P is general recursive in predicates 
Q v =..,(?! of the k-quantifier forms of Theorem V, then P is expressible in 
both the, k +1 quantifier forms, and (b) conversely. 

The proof of Theorem VI (b) establishes the present (b), when reread 
taking the R and S now to be of appropriate ^-quantifier forms. The 
proof of (a) will be completed following the lemmas. 

Lemma IV C . The representing predicate of the representing function of 
a predicate of either k-quantifier form is expressible in the k -f-1 -quantifier 
form with existence first. 

Proof of Lemma I V. For example, suppose P(a) == ( x)(Ey)R(a, x, y) 
with recursive R. Then 
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P(a,w) = {( x)(Ey)R(a,x,y) & w= 0} V {( x)(Ey)R(a,x,y ) & w = \) (by (41)) 
= -{{x 1 ){Ey 1 )R(a, x v yj & w= 0 } V {{Ex 2 ){y 2 )R{a, x 2 ,y 2 ) &w= 1} 

(cf. *85, * 86 ) 

= (£A 2 )(^ 1 )(y 2 )(£y 1 )[{i?(«, x v y t ) & w=0} V (R(a, % 2 , y 2 ) & w= 1}] 

(cf. *89—*92) 

= (Ex){y)(Ez)[{R(a, {y) 0 ,z)&w= 0} V {i?(a, *, (y) x ) & w = 1}] (by (18)). 

The last expression is of the form (Ex)(y)(Ez)R{a,w, x,y, z) with re¬ 
cursive R. (This and the next lemma could also be proved for generality 
first.) 

Lemma V. Let p{a lt ...,a m ) be the course-of-values function for 
<{/(«!, ..., a m ). If the representing predicate of ^{a^ .... a m ) is expressible 
in the k -f -1 -quantifier form with existence first, so is the representing 
predicate of $(«, a). 

Proof of Lemma V. For example, take m = k — 1 . By hypothesis, 
ty(a)=w = (Ex)(y)Q(a, w, x, y) with recursive Q. Now 

$(«)=«' = w = n & {i) i<a ty{i)=(w) i 

i<a 

= W = n p^i & (i) i<a {Ex)(y)Q(i, {w) it x, y) 

i<a 

= W = n p^i & (Ex)(y){i) i<a Q(i, (w) { , {x) it y) (by Remark 1 § 57) 

i<a * 

= {Ex){y){w = n p^i & {i) i<a Q(i, {w) u ( x) it y)) (cf. *91, *89). 

i<a 

The last expression is of the form ( Ex)(y)R(a , w, x, y) with recursive R. 

Proof of Theorem XI (a). For example, take l = m — k= 1 . The 
hypothesis that P is general recursive in Q means (§ 55) that P is general 
recursive in the representing function of Q. By Lemmas IV and V, 
ty(a)=w == (Ex)(y)R(a, w, x, y) with recursive R. We show first that 
P(a) is expressible in the 2-quantifier form with generality first. Using 
Theorem VI* (a) with its proof from Theorem IV* (7), 

P(a) = [t)T\$(t), g, a, t) = (/)(s)$(*)=s -* f}(s, g, a, *)] 

= $(s)[(£*)(y)fl(*, s. y) -> T}(s, g, a, 0] 

= (<)(s)(«)(Fy)[R(^ S, *, y) -> T}(s, g, a, /)] (cf. *96, *98) 

= W(Fy)[R(Wo, (x) v {x) 2 , y) T\{(x) v g, a, (*)„)] (by (18)), 

vhich is of the desired form. To express P(a) in the other 2-quantifier 
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form, we have similarly using Theorem IV* (6), 

P(a) s (£<)r}($(<), /, a, t) = (£*)(£ S )[M= S & T\(s, f, a, t)], etc. 

Corollary 0 . For each of the k -(-1 -quantifier forms, the predicate of 
Theorem V (b) which is expressible in the other k-\-\-quantifier form but 
not in the form itself is not recursive in predicates expressible in the forms 
with k {or fewer) quantifiers. 

We get a Theorem XI* (with corollary) by replacing "general re¬ 
cursive’’ for the forms of Theorem V (b) by "general recursive in T”. 

*§ 59. General recursive functions and the number-theoretic 
formalism. In this section, "(i)”—“(vii)” will refer to §41. 

We say that a number-theoretic predicate P(x v .... x n ) is resolvable 
in a formal system (or decidable within the system), if there is a numeral- 
wise decidable formula P(x 1 , ...,x n ) (cf. (iv)) with no free variables 
other than the distinct variables x x , ..., x n such that, for each «-tuple 
x v ..., x n of natural numbers, 

(viii) P(x v ...,*„) = |- P(x x , .... x n ). 

In this case, P(x 1 , ...,x n ) resolves P(x v .,.,*„) (with the obvious 
correspondence of formal to intuitive variables). 

A number-theoretic function <p(x v ..., x n ) is reckonable in a formal sys¬ 
tem (or calculable within the system), if there is a formula P(x 1( ..., x B , w) 
with no free variables other than the distinct variables x x , ..., x n , w 
such that, for each %, ..., x n , w, 

(ix) 9 (^ 1 * • • •» x n )~w == H P(x v • • •# x n , u?). 

In this case, P(x x , ..., x n , w) reckons cp(x x , ..., x„). 

Theorem 32. Let S be the number-theoretic formal system of Chapter IV 
{or Robinson’s system described in Lemma 18b § 49). If S is simply con¬ 
sistent, then : {9 is general recursive } = {<p is numeralwise representable in S} 
23 {9 is reckonable in 5}. 

Proof. We establish three implications. 

(a) 7/9 is general recursive, then 9 is numeralwise representable in S. 

By Theorem IX § 58, there is a number e for (29) and (30). 

Using (29) and the definition of T n preceding Theorem IV § 57, 

( 42 ) vyT '” * 1 - • • • > *», y)— w = 

Rn (?> %1> • • •» %n> w) & (■^)z<M)5'n (^> ^1> • • • > %nr %)• 

By Corollary Theorem 27 § 49, since S n {z, x v ..., x n , w) is primitive 



296 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


recursive, it is numeralwise expressed by a formula S(z, x 1( .. x n , w). 
We shall prove that the formula 

S(e, x v ..., x„, w) & Vz(z<w D ~iS(c, x 1( ..x n , z)), 

call it “Mfo, ...,x B , w)”, numeralwise represents 
\jyT n {e, x v ..., x n , y). Let x v ..., x n be any fixed n-tuple. To establish 
(v) for M, suppose that \iyT n (e, x 1 , ..., x n , y) — w. Then using (42), 
and (E) and (C) § 41 with (i) and (ii) for S, b M(x lf ...,x n ,w). Thus 
we have (v) for M. But then we also have (vi) by *174a (with w as 
the t). So (j iyT n (e, x v .... x n , y) is numeralwise representable. 

By Theorem 27, since U{y) is primitive recursive, it is numeralwise 
representable. 

By the reasoning for Case (IV) in the proof of Theorem 27, then the 
composite function U([iyT n (e, x v ..., x n , y)), i.e. (by (30)) <*>(%, ..., x n ), 
is numeralwise representable. 

In this proof, besides the predicate calculus, Axioms 14 — 21, the 
replacement property of equality, and results already employed in 
obtaining Theorem 27, we have used only (E) and *174a (with the 
numeral w as the t). Hence, using Lemmas 18a §41 and 18b §49, the 
implication (a) holds also for Robinson’s formal system. 

(b) If S is simply consistent, and <p is numeralwise representable in S, 
then 9 is reckonable in S (and any formula P which numeralwise represents 
9 reckons 9 (and by § 41 is numeralwise decidable)). 

Consider any fixed x v ..., x n . We have (v) by hypothesis. To establish 
the converse, assume that b P(x 1( ■ • x n , w). Now 
<f(x lt ..., x n )—w V <p(tc v . . x n )^w (cf. *158 §40). But if 
9 (%, ..., x n )^w, then using (vii) we contradict the simple consistency. 

(c.) If 9 is reckonable in S, then 9 is general recursive. 

Suppose <p(x v ...,x n ) is reckoned by the formula P(x x , ...,x„, w). 
If S is the full number-theoretic system, let Pf p be the primitive re¬ 
cursive predicate corresponding by the Godel numbering to (cf. 
Dnl 3 a § 51 and Lemma 19 § 52, or Theorem 31 § 52). If S is Robinson’s 
system (Lemma 18b), we must first make the appropriate changes in 
Dn 8 . Now (xj) ... (x n )(Ey)Pfp(x v ..., (y) 0 , (y) x ) and <p(x v ...,x n ) = 

{[LyPfp (x v ..., (y) 0 , (y)j)) 0 . Hence using Theorem III §57, 9 is 

general recursive. 

Corollary. Under the hypotheses of the theorem, and if 
(a x ) ... (x n ) [P(x x , ..., x n ) V P(x v ..., x n )]: [P is general recursive } = 
{P is numeralwise expressible in S} ss [P is resolvable in S}. 
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Proof. Let <p be the representing function of P. We establish four 
implications, 

{P is general recursive} ^ {<p is numeralwise re-presentable in 5} 

{P is numeralwise expressible in S} 

{P is resolvable in S (and any formula P which numeralwise expresses 
P resolves P)} 

{P is general recursive }, 

where the simple consistency of S is a further hypothesis for (f) and (g), 
and also (%) ... (x n ) [P{x v ..., x n ) V P{x v ..., *„)] for (f) (to make 
the proof of (f) valid intuitionistically). 

(d) From (a) by the definition of general recursive predicate. 

(e) As in the proof of Corollary Theorem 27 § 49. 

(f) We obtain (iv) as in § 41. We have (i) by hypothesis. To establish 
the converse, assume that (- P(x x , ...,x„). Now 

P(x lt ..., x n ) V P(x v ..., x n ). But if P(x v ..., x„), then using (ii) we 
contradict the simple consistency. 

(g) Suppose that the formula P(x 1 , ..., x n ) resolves P(x v ..., x n ). 
Similarly to (c), 

(*i) ... (x n ) (Ey) [Pf p (x v y) V P/_, P (x v ...,x n , y)] (since P is 

numeralwise decidable) and 

P{x v .. .,*„) = Pf P {x v .. .,x n ,y.y[Pf v {x v ..., x n ,y) V Pf^ P (x v .. .,x n ,y)]) 
(using (viii) and the simple consistency of S). (The simple consistency 
of S is implied by (viii), if {Ex-f) ... (Ex n )P(x v ...,x n ).) 

The equivalence of reckonability (resolvability) to general recur¬ 
siveness was proved by Mostowski 1947 for any simply consistent S 
containing the usual number theory and such that (R x ) primitive re¬ 
cursive predicates are resolvable in S (cf. Corollary Theorem 27) and 
(R 2 ) the predicates P/ A are primitive recursive (cf. Dnl3a and Lemma 19). 
Other references will be given in § 62. Cf. R. M. Robinson 1950 abstract. 

Let T be a list <J/ X , ..<pi of functions of m lt variables, re¬ 

spectively. Choose distinct predicate letters Q lf .... Q,, and annex 
them to the stock of formal symbols for 5. Extend the definition of 
formula for S by providing that Q^tj, ..., t TO/ , t) be a formula for each j 
(j = 1 , ...,/) and terms t v .... t mj , t. Let FqJ "£ be the set of the 
formulas Qj(y v ..., y mj , y) & Vz(Q,(y 1 , .... y mj , z) D y=--z) where 
+,(y v •... ymj) = y, for / = 1 , ..., Z and all m r tuples y v . ..,y mj of 
natural numbers. Let notions simply consistent (numeralwise decidable, 
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numeralwise expressible, numeralwise representable, resolvable, reckonable) 
in (or from) Y be obtained by changing "b” to “FqJ'''q| (-” ( or in 

words, "provable” to “deducible from in the former def¬ 

initions. Theorems 27, 31, 32 and corollaries (including (a) — (g)) hold 
with these notions and “Fq*"^ b”» "primitive recursive in Y” and 
"general recursive in Y” in place of the respective former notions. (Also 
P is numeralwise expressible uniformly in (or from) Y, if a formula P can 
be given independently of Y which numeralwise expresses P from Y 
for each choice of Y under consideration; and similarly for numeralwise 
representability, resolvability and reckonability. The theorems hold 
when the notions for P or cp relative to Y are understood throughout as 
uniform.) 

Equivalently, we may instead choose distinct function letters g x , ..., g,, 
extend the definition of term to include g j (t x , ..., t m .) when t x , ..., t m . 
are terms, and substitute "E^ 1 " ^ b” (§ 54) for “ b” in the former 
definitions. 

If we let Y include predicates, we may proceed in either of these 
ways, taking as < p, the representing function of Q, when the f-th of the 
Y’s is a predicate ; or instead for each such j we may introduce a pred¬ 
icate letter to be used with m } arguments, and let the assumption 
formula corresponding to this j and a given m r tuple y v ..., y mj be 
Qi(yv • • •. ym,) or -*Qi(y v ...,ym,) according as Q j {y 1 , . ..,y m ,) or 
Qi(yi . ymj)- 

§ 60 . Church’s theorem, the generalized Godel theorem. We 

now undertake to answer finally the question whether informal mathe¬ 
matics can be completely formalized (§ 15). We know by Godel’s theorem 
(§ 42) that the particular formal system of Chapter IV does not com¬ 
pletely formalize intuitive number theory. 

In informal number theory we consider propositions depending 
on parameters. Infinitely many particular propositions arise, according 
to the natural numbers taken as values by these parameters. We describe 
this situation by saying that the propositions are the values of predicates. 
Generally we have under consideration a number of predicates simul¬ 
taneously. However, let us discuss the formalization in relation to one of 
them, say a predicate P(x) of one variable x. 

The formal number-theoretic system of earlier chapters provides 
one illustration of how formalization can be carried out. Let us put 
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aside all the finer details, and consider only the bare skeleton of what 
a formal system must provide if it is to serve its purpose. 

The formal system must have some domain of ‘formal objects’. Among 
these, if the system is to constitute a formalization of the theory of the 
predicate P(x), there must be particular distinct formal objects which 
we identify as expressing the propositions P( 0), P( 1), P( 2 ), ..., i.e. 
P(x) for x = 0 , 1 , 2 ,.... We may conveniently designate these formal 
objects by “A(0)”, “A( 1)”, “A(2)”, ..., respectively, i.e. “A(x)” for 
x — 0 , 1, 2, ...; and call A(x) ‘the formula expressing P(x)’. We do 
this without making any assumption as to whether there be such things 
as numerals x, a variable x, and a formula A(x) from which A(x) comes 
by substituting x for x. 

Next, there must be a category of formal objects called ‘proofs’. 
Each proof must be a proof ‘of’ a particular formal object, which may 
or may not be the object A(x) for a given natural number x. Let “9f (x, Y)” 
stand for the metamathematical proposition that Y is a proof of A(x). 
The object A(x) is said to be ‘provable’, if there is a proof of it. Let 
“f- A(x)” stand for the proposition that A(x) is provable; i.e. 

(43) (£Y)9t(*, Y) = b A(x). 

What is the nature of the predicate Y) ? Our purpose in formal¬ 
izing a theory is to make explicit the conditions which determine what 
propositions hold in the sense of being provable in the theory (§ 15), or 
in brief to give an explicit definition of what constitutes a proof. This 
purpose will be accomplished, for the theory of the predicate P(x) as 
expressed by the formula A(x), only if there is a preassigned procedure, 
not requiring any mathematical invention on our part to apply, by which, 
whenever we are given a particular natural number x and formal object Y, 
we can tell whether Y is a proof of A(x) for that x. That is, there must 
be a decision procedure or algorithm for the question whether 9t(x, Y) 
holds (§ 30). We shall also express this by saying that 9f(#, Y) must be 
an “effectively decidable” metamathematical predicate. 

We have not stipulated what kind of structure the domain of the 
formal objects should have. However each formal object Y must be 
capable of being given as a finite object, as otherwise it would make no 
sense to speak of a decision procedure for 9 ?(*, Y). This may mean that 
each Y can be generated from a finite number of initial objects by a 
finite number of applications of recognized operations, as in our con¬ 
ception of a generalized arithmetic (§ 50); or that each Y can be given 
as a figure constituted out of a finite number of occurrences of symbols 
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from d preassigned enumeration of symbols. By familiar methods (§§ 1, 
50, 52) we can then give an effective enumeration of the formal objects, 
or perhaps more conveniently an effective Godel numbering, i.e. a 
1-1 correspondence between the formal objects and a subset of the 
natural numbers. The effectiveness means that, given any formal object 
Y we can always find the corresponding number y, and inversely given 
any natural number y we can always determine whether it corresponds 
to a formal object, and if so find that object Y. Thereby we correlate to 
the effectively decidable metamathematical predicate 9l(x, Y) an ef¬ 
fectively decidable number-theoretic predicate R(x, y), where R(x, y) = {y 
is the natural number correlated to a formal object Y such that 9t(x, Y)}. 
Then (EY) 8 t(x, Y) = (Ey)R(x, y) ; and by (43), 

(44) (Ey)R(x, y) = (- A(x). 

What kind of a number-theoretic predicate can meet the condition 
on R(x,y) of being effectively decidable? For the formal system of 
Chapter IV, and any formula A (a) in that system, this predicate R(x, y) 
(corresponding by the Godel numbering to ‘Y is a proof of A(x)’) is 
primitive recursive, as we showed in proving Theorem 31 § 52. 

Any general recursive predicate is effectively decidable. For any 
general recursive function <p is effectively calculable. Given a system E 
of equations defining 9 recursively, an effective process for finding the 
value of 9 for given arguments x v ..., x n is afforded by deducing equa¬ 
tions from E until one f(x 1 , ..., x n ) = x expressing that this value is x 
is found. Such a system E always does exist and such an equation can 
always be deduced, according to the definitions of ‘general recursive 
function' (§ 55) and ‘E defines 9 recursively’ (§ 54). (We can also see that 
9 is effectively calculable, by considering Theorem IX (29) and (30) 
§ 58, or Corollary Theorem IX.) Then if R is a general recursive predicate, 
we can decide whether it is true or false for given arguments x v ..., x n , 
by calculating the value of the representing function for those arguments, 
and reading whether that value is 0 or 1 . 

The converse of this has seemed also to be true. Every example of a 
function (predicate) acknowledged to be effectively calculable (de¬ 
cidable), for which the question has been investigated, has turned out to 
be general recursive. This heuristic evidence and other considerations 
led Church 1936 to propose the following thesis. 

Thesis I. Every effectively calculable function {effectively decidable 
; predicate) is general recursive. 

This thesis is also implicit in the conception of a computing machine 
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formulated by Turing 1936-7 and Post 1936 . We postpone the discussion 
of the evidence for the thesis to the next two chapters, and proceed at 
once to consider its implications. 

The thesis and its converse provide the exact definition of the notion of 
a calculation (decision) procedure or algorithm, for the case of a function 
(predicate) of natural numbers, which we were not ready to provide in 
§ 30. To give a decision procedure for a predicate P(x) thus means to 
give a general recursive predicate R(x) such that P(x) = R(x). By 
Theorem V (14) and (15) § 57, we have the following theorem, originally 
stated by Church 1936 with a different example of an “unsolvable” 
decision problem. 

Theorem XII. There is no decision procedure (or algorithm) for 
either of the predicates (y)T 1 (x, x, y) or (Ey)T 1 (x, x, y). 

From the considerations in the first part of this section, by applying 
Thesis I to the R of (44), we obtain a second thesis. 

Thesis II. For a given formal system S in which the values of a pred¬ 
icate P(x) are expressed by designated distinct formulas A(x) (x = 0 , 1 , 
2 , ...), the predicate 'A(x) is provable in S’ is expressible in the form 
(Ey)R(x, y) where R is general recursive, i.e. there exists a general recursive 
predicate R such that (44) holds. (Similarly for a predicate P(x v ..x n ) 
of n variables .) 

Remark 1 . Here we have considered only the formulas A(x) in 
which we are interested. If we take other formulas into account, and 
the system satisfies our usual conceptions of what it should accomplish, 
we can analyze further. Let “$(A, Y)” denote ‘Y is a proof of A' as a 
predicate of any two formal objects A, Y. This predicate $(A, Y) will 
be effectively decidable, A(x) will be an effectively calculable function of 
x, and the effective decidability of SR(*, Y) will result by taking 
9f(x, Y) = 3 p(A(x), Y). For an effective Godel numbering of the formal 
objects, let “A 0 ” stand for the object (if any) having a as its number; 
and let “ h A 0 ” express that this object is a provable formula (“h A a ” 
being false when a is not a Godel number). To $(A, Y) we correlate P(a, y). 
Using Thesis I: (a) There is a general recursive predicate P such that 
(Ey)P(a, y) = \- A a . (b) The Godel number a(x) of A(x) is a general 
recursive function of x. Thesis II follows by taking R(x, y) = P(<x(x), y). 

Thesis II formulates the (minimum) structural requirement on a 
formal system 5 in order for it to serve as a formalization of the theory 
of a predicate P, with the propositions P(x) for x — 0 , 1 , 2 , ... expressed 
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by the respective formulas A(x). As in the case of Thesis I, the converse 
holds (as stated below). 

It remains to state the conditions on 5 relating provability of the for¬ 
mulas A(x) with the propositions P(x) which they are intended to express. 
For the formalization to be correct (or consistent) for P(x), it is required 
that 

(45) |- A(x) -> P(x), 

i.e. A(x) is provable in S only when P(x ) is true. If the system S is also 
to constitute a complete formalization of the theory of the predicate, we 
must also have, conversely, that P(x) -> [-A(x), i.e. A(x) is provable 
whenever P(x) is true. Combining this with (45), 

(46) |- A(x) = P(x) 

in the case that S is complete as well as correct for P(x). 

Combining this with the structural requirement given in Thesis II: 
To give a correct formal system for the predicate P(x) entails finding a 
general recursive predicate R such that 

(47) (Ey)R(x, y) -> P(x); 
to set up one that is also complete, such that 

(48) ( Ey)R(x,y) ^ P(x). 

Now suppose P(x) is the predicate (y)T 1 (x, x, y). Then there is no 
general recursive R such that (48) holds for all x; for then by Theorem 
V (12), given any general recursive R, there is a number / such that (48) 
fails for x = /. Thus: 

Theorem XIII (Part I). There is no correct and complete formal 
system for the predicate (y)7\ (x, x, y). (A generalized form of Godel’s 
theorem, Kleene 1943 .) 

To examine the situation in more detail, consider any formal system 
S and choice of formulas A(x) in S to express (y)T l {x, x, y) for x = 0, 
1,2, .... Let R be the general recursive predicate such that ( 44 ) holds 
(given by Thesis II); and consider the / of (9) for this R. Suppose S is 
correct for (y)^#, x, y); then by ( 45 ), 

(49) bA(x) - {y)T l (x,x,y). 

By the* informal counterpart of *86 § 35, (9) and ( 44 ), 

(50) (y)Ti(f, f, y) = mTyU, f,y) = (Ey)R(f, y) ^TW)- 
Assume b A(f). Then by (4 9), (y)T x {f , /, y); and by (50), b A(/). By 
reductio ad absurdum, bA(f); and by (50), (y)T 1 (f, /, y). Thus: 
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Theorem XIII (Part II). In particular, suppose S is a formal system 
with, distinct formulas A(x) designated as expressing the propositions 
(y)Ti(x, x, y) for x = 0, 1, 2, .... Then a number f can be found such that : 
If S is correct for {y)T 1 {x, x, y), then {y)T 1 {f, f, y) & h Mf ); *•<?. the propo¬ 
sition (y)T x (f, f, y) is true, but the formula A(f) expressing it is unprovable. 

Thus no formal system can be complete for the purpose of proving the 
true (and only the true) propositions taken as the values of a certain pre¬ 
assigned intuitive predicate (yjT^x, x, y). Nothing is assumed about the 
formal system, except that it fulfils the structural requirement expressed 
in Thesis II, and yields only results correct under the interpretation of 
the formulas A(x) as expressing the values of the predicate (y)T x (x, x, y). 

These assumptions constitute a very considerable abstraction from 
the particular formal systems we have studied. In those, a proof consisted 
in applications of listed postulates. We had grounds for belief in the cor¬ 
rectness of each of these postulates separately, and hence in the system 
as a whole. The Godel incompleteness, we now see, does not depend on the 
nature of this intuitive evidence. 

To emphasize this, we can imagine an omniscient number-theorist. 
We should expect that his ability to see infinitely many facts at once 
would enable him to recognize as correct some principles of deduction 
which we could not discover ourselves. But any correct formal system 
for (y)T 1 (x, x, y), which he could reveal to us, telling us how it works 
without telling us why, would still be incomplete. 

To understand the meaning of the propositions {yfT-^x, x, y), only 
the notion of a particular effectively calculable predicate (indeed, of 
one which is primitive recursive), and of the universal quantifier used 
constructively, are required. Lesser conceptual presuppositions, if any 
mathematical infinite is to be allowed, are hardly conceivable. 

In using this predicate (y)T 1 (x,x,y) on the metatheoretic level, as 
already meaningful to us, we have not assumed that each value of it 
is either true or false. What we can conclude by only finitary reasoning 
with this predicate is enough, taken with Thesis II, to rule out the 
possibility of our ever having a correct and complete formal system for it. 

Here we have been dealing with the incompleteness of any (correct) 
formal system as a formalization of the existing intuitive theory of the 
predicate (y)r x (x, x, y). Because this interpretation has been handled 
in a finitary way, the theorem can be considered as metamathematical 
in the broader sense of the term. 

For formal systems having some ordinary formative and deductive 
properties, the theorem can also be formulated metamathematically in 
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the narrower sense, replacing the reference to the interpretation of the 
formulas A(x) as expressing the values of the predicate (y)T 1 (x,x,y) 
by consistency and completeness defined as intrinsic properties of the 
system; e.g. briefly thus: 

Theorem XIII (Part III). Let S be a formal system having distinct 
formulas T(x, y), -iT (x, y), A(x) (also written “Vy-i T(x, y) ”) and 
—iA(x) (also -written i Vy -iT(x, y)”) (x, y = 0 , 1,2, ...). Suppose that 

(A ) T 1 (x,x,y) b T(x, y) and T^x.x.y) h-iT (x, y), 

(B) (-Vy~iT(x, y) (y){ h ~ | T(x, y)}, and (C) for some general re¬ 
cursive R, (44) holds. Then a number f can be found such that : If S is simply 
consistent in the sense that for no x, y both b T(x, y) and (- -iT(x, y), 
then b A(f). If S is also o>-consistent in the sense that for no x both 
(y){ b ~iT(x, y)} and \- — i Vy -iT(x, y), then b “iA (f). Thus if S is simply 
and (o- consistent , it is simply incomplete in the sense that for some x neither 
b A(x) nor b ~iA(x). 

Proof. Assume that S is simply consistent, and (for reductio ad 
absurdum) that b A (f). Then by (44), ( Ey)R(f , y); by (9), (Ey)T x (f, /, y); 
and by (A), b T(f, y) for some y. But also from b A(/) by (B), b “»T (f, y) 
for this y, contradicting the simple consistency. By reductio ad absurdum, 
bA (f). Now assume that S is also co-consistent. From b A (f) by (50), 
(y)?\(/./. y); b y (A), (y){b-iT(f,y)}; and by the co-consistency, 
b -i Vy -iT (f, y), i.e. b i A (f). 

Mostowski 1952 , which was not available during the writing of the 
present book, compares various proofs of Godel’s theorem. 

The incompleteness theorems XII and XIII appear in our presentation 
as applications of the cases of Theorem V for the respective predicate 
forms R(x) and ( Ey)R(x , y). This theme is developed at length in Kleene 
1943 *. We conclude this section with some further remarks about the 
predicate form (Ey)R(x,y). 

Converse of Thesis II (Part I). There is a correct and complete 
formal system for any predicate of the form (Ey)R(x, y) with R general 
recursive. (Similarly for (EyP) ... (Ey m )R(x v ...,x n ,y v ...,y m ) with 
R general recursive (n, m>: 0).) 

In more detail (for n — m = 1 ): Given any general recursive predicate 
R(x, y), a formal system S with distinct formulas A(x) for x = 0 , 1,2,... 
can be found such that (44) holds. 

In Example 1 the correctness of the system remains a hypothesis. 
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However Example 3 provides a quick and unimpeachable demonstration. 
(Here (44) is the (viii) of § 59, but in general (iv) is lacking for resolva¬ 
bility.) 

Example 1 . By Corollary Theorem IV § 57, there is no loss of 
generality in taking R to be primitive recursive. Let 5 be the number- 
theoretic formal system of Chapter IV, R(x, y) be a formula which numer- 
alwise expresses R(x,y ) (Corollary Theorem 27 §49), and A(x) be the 
formula 3yR(x, y). Then clearly ( Ey)R(x , y) ->• (- A(x). So (44) holds, 
if the system has the property that b 3yR(x, y) only when ( Ey)R(x , y). 
This is a consistency property which (if (Ex)(Ey)R(x, y)) implies simple 
consistency, so by Godel’s second theorem (Theorem 30 § 42) we cannot 
expect to find an elementary proof of it; and in fact for the classical 
formal system it is not clear how any proof of it can be given without 
using classical logic in the metalanguage. 

Example 2 . Similarly taking as S Robinson’s system (Lemma 
18b § 49) with only thirteen number-theoretic axioms. A lengthy but 
elementary proof of the required consistency property (for an R(x, y) 
constructed as in the proof of Corollary Theorem 27) will be given in 
§§77—79 (cf. Theorem 53 (b) §79). — A fortiori (or directly, cf. Theorem 
53 (a)), Robinson’s system is simply consistent. 

Example 3. Let E be a system of equations in the formalism of 
recursive functions which defines recursively the representing function 
of R, with f as principal function letter. Let the predicate letter <-7 be 
added to the stock of formal symbols. Let S be the system having the 
equations of E as its axioms, and having as its rules of inference Rl, 
R2 and the following, where x and y are numerals: 

fjx,y)=0 
a{x). 

The consistency property (with b?(x) as the A(x)) is immediate. 

Example 4. Let the formal symbols of S comprise only 0, ', and 
the two predicate letters 2^ and El (with comma and parentheses). The 
formulas shall be the expressions RJ^x, y) and El{x) for x, y — 0, 1, 2, .... 
The postulates shall be an axiom schema 1 and a rule of inference 2, 
as follows. For Axiom Schema 1, x and y are stipulated to be numerals 
such that R(x, y). 

Kjx, y) 

El(x). 


l. K{x,y). 


2 . 
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This formal system may seem unorthodox. But the stipulation on the 
x and y for Axiom Schema 1 is an effective one (since R is general re¬ 
cursive), just as e.g. that on the t for Axiom Schemata 10 and 11 § 19. 

A fifth example is given as Example 2 end § 73. 

Converse of Thesis II (Part II). There is a formal system S such 
that, given any general recursive predicate R(x,y), distinct formulas A(x) 
for x — 0, 1,2, ... can be found such that (44) holds. (,Similarly for any n, 
m\ or for all n, m simultaneously.) 

Proof. Let A (z, x) (z, x — 0 , 1 , 2 , ...) be the formulas obtained by 
applying Part I (for n — 2, m = 1) to T^z, x, y); and let A(x) (x = 0, 
1,2_) be A (f, x) for the / of (9) § 57. 

Remark 2 . By adding new postulates to S (in Part I or II), we 
obtain systems S' such that ( Ey)R(x , y) -> f- A(x), but not necessarily 
t-A(x) -> (Ey)R(x,y). 

By Thesis II and its converse, the same predicates are expressible in 
the form ( Ey)R(x, y) for some general recursive R as are expressible by 
|- A(x) for some formal system S and effective designation of formulas 
A(x) for x = 0, 1,2, .... Briefly, the predicate form (Ey)R(x,y) coin¬ 
cides- with the notion of provability in some formal system. 

By the result mentioned at the end of § 53 (with Corollary Theorem IV) 
inductive definitions (with constructive direct clauses) lead to precisely 
the same class of predicates. This fact is closely related to the foregoing, 
in view of the role often given to inductive definitions in defining formal 
systems. 

Recursive enumerability. A set or class C of natural numbers is 
recursively enumerable, if there is a general recursive function 9 which 
enumerates it (allowing repetitions), i.e. such that 9 ( 0 ), 9 ( 1 ), 9 ( 2 ), ... is 
an enumeration (allowing repetitions) of the members of C. (Post 1944 
includes also the empty class as recursively enumerable.) 

Theorem XIV. A class C having a member is recursively enumerable, 
if and only if the predicate x £ C is expressible in the form (Ey)R(x, y) with 
a general recursive R. 

In more detail: (a) If 9 enumerates C, then xEC = ( Ey)R(x,y ) with 
an R primitive recursive in 9 . (b) If x&C = (Ey)R(x, y) and C has a 

member m, then C is enumerated by a function 0 primitive recursive in R 
(Kleene 1936 ). 
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Proofs, (a) xt C= [ Ey )[^{ y )— x ] (cf. #14 §45). 


(cf. ##D, F, 19). 



( y)o if R ({ y ).. (y)i). 
w if R((y) 0 , (y) x ) 


(b) Let 


Thus Thesis II is equivalent to saying that the class C of the numbers 
x for which A(x) is provable is recursively enumerable (if it has a member). 
We call a set or class C general recursive, if the predicate x S C is general 
recursive. Paraphrasing results of § 57 : A general recursive class C is a 
fortiori recursively enumerable, if it has a member (Theorem VI (a)); 
likewise its complement C (#D § 45 ). Classically, a class C is general 
recursive, if both C and C are recursively enumerable (Theorem VI 
(b) or (c)). The class of the x’s such that (EyjT^x, x, y) (in symbols, 
^(£y)r x (x, x, y)) is recursively enumerable (by ( 29 ) § 58 , (Ey)T 1 (e, e, y), 
so e is a member), but not general recursive (Theorem V ( 15 )); its com¬ 
plement x^T^x, x, y) is neither recursively enumerable nor general 
recursive (Theorem V (12) and ( 14 )). 

Corollary. If a class can be enumerated (allowing repetitions) by a 
general recursive function, it can be enumerated {allowing repetitions) by 
a primitive recursive function. (Rosser 1936.) 

Using (a), then Corollary Theorem IV, then (b). 

Example 5 . An infinite class C is general recursive, if and only 
if it is recursively enumerable without repetitions in order of magnitude 
(Kleene 1936). (Hint: Use Theorem III.) Every infinite recursively 
enumerable class contains an infinite general recursive subclass (Post 
1944). If an infinite class is recursively enumerable allowing repetitions, 
it is recursively enumerable without repetitions (Kleene 1936). 

Example 6. Problem: to define constructively (i.e. intuitionistically) 
‘general recursive function’ from 'recursively enumerable class’. We must 
avoid the application of the law of the excluded middle which occurs in 
saying that a class is either empty or has a member, and the non-in- 
tuitionistic steps in the proof of Theorem VI (b). Solution by combining 
the first of the following two propositions, Theorem VI (c), and the second 
(or Theorem XIV): A function <p(x x , ..., x„) is general recursive, if and 
only if the class of the numbers 2 <p(xi x,,) ■ p\ x ■... • p x ” is general re¬ 
cursive. A predicate x 6 C is expressible in the form [Ey)R{x, y) with 
general recursive R, if and only if the class (call it {0}+C’) consisting of 
0 and the successors of the members of C is recursively enumerable. 
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We obtain a notion ‘enumerable recursively in Y’ by reading “function 
9 general recursive in Y” in place of "general recursive function 9 ” in 
the definition of recursive enumerability. The results extend to this. 

§ 61. A symmetric form of Godel’s theorem. Theorem XIII § 60 
generalizes Theorem 28 § 42 (Godel’s theorem in the original form), 
with A (f) corresponding to the A p (p). Part III of Theorem XIII was 
formulated metamathematically in the narrower sense. We shall now 
take up thus a generalization of Theorem 29 (the respective Rosser form 
of Godei's theorem). 

In place of (y)Ti(^, x, y), we now use the slightly more complicated 
predicate {y)\Ti((x) v x,y)_V {Ez^yT^x)^ x, *)], or its equivalent 
(Ey)[Tj{(x) lt x , y) & {z) zSy T x {{x ) 0 , *, z)]. 

Let “W 0 (x,y)” abbreviate T^x)^ x, y) & (z) z < y T 1 ((x) 0 , x, z) and 
“W^x, y)” abbreviate T^x)^ x, y) & {z) z < y T l {{x) 1 , x, z). 

Let x be fixed. Suppose there is a number y 1 such that (i) 7 \((;*;) 0 , x, y x ) 
and (ii) {z) z < Vl Ti{{x)i, x, z). Then there can be no number y 0 such that 
(iii) T 1 {{x) 1 ,x,y 0 ) and (iv) (*),<*?!((*)„,*,*). For (i) and (iv) imply 
y 1 > y 0 , and (ii) and (iii) imply y 0 > y v Thus 

(51) (Ey)W 1 (x, y) -> {Ey)W 0 {x, y). 

Since the predicates W 0 (x, y) and W^x, y) are primitive recursive 
(using ##A, C, D, E, 19 § 45), the theory of the predicate (Ey)W 0 {x, y) 
can be completely formalized, and likewise at least as much of the theory 
of {Ey)W^{x, y) as is given by the sufficient condition ( Ey)W 1 {x , y) of (51) 
(by the Converse of Thesis II and Remark 2 § 60). We shall now show 
that a formal system S which formalizes at least this much, if consistent, 
cannot be complete. 

Accordingly let S be any formal system in which there are formulas 
B(x) and ~iB(x) for x = 0, 1,2, . .., all distinct. We shall not make 
any restrictive assumptions as to what kind of symbolism 5 has, or in 
particular that B(x) comes from a formula B(x) by substituting a numeral 
x for a variable x, or that -iB(x) comes from B(x) by prefixing a certain 
symbol - 1 . The deductive rules of 5 shall be such that 

(52) (Ey)W 0 (x,y) bB(x), (53) (Ey)W 1 (x, y) - h iB(x). 

The system S is to serve the purpose of giving an explicit criterion of 
what constitutes proof for the formulas B(x) and -iB(x), and hence 
(although we now avoid specifying that B(x) and ~iB(x) should express 
certain predicates) it is demanded as before by Thesis II that there exist 
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general recursive predicates R 0 (x, y ) and Ri(x, y) such that 

(54) (Ey)R 0 (x, y) = b B(x). (55) {Ey)R x {x,y) = b iB(x). 

By the {simple) consistency of S we shall mean that for no natural number 
x, both b B(x) and b ~>B(x); and by the {simple) completeness that 
for every x, either b B(x) or b ~iB(x). 

Theorem XV. There is no simply consistent and complete formal 
system satisfying (52)—(55). 

In more detail: Given any formal system S with distinct formulas B(x) 
and -iB(x) {x = 0, 1 , 2 , ...) and general recursive predicates R 0 {x,y) 
and R x {x, y) such that (52) — (55) hold, a number f can be found such that, 
if Sis simply consistent, then neither b B(f) nor b ~>B(f). (Rosser’s form of 
Godel’s theorem, in a generalized version.) 

Proof. Assume that S is simply consistent, or in symbols 

(56) b B(x) & b -iB(x). 

By Theorem IV ( 6 ) § 57, there are numbers / 0 and f x such that, if 
we put / = 2 /o -3 /l , then 

(57) {Ey)R 0 {x, y) = {Ey)T x {f 0 , x, y) = (. Ey)T x ({f) 0 , x, y), 

(58) (Ey)R 1 {x,y) = {Ey)T x {f x , x,y) = {Ey)Ty[{f) lt x, y). 

In the rest of the proof, each time we use (52) — (58), we substitute the 
number / for the variable x. To show by reductio ad absurdum that 
b B(f), suppose that 

(a) b B{f). 

Then by (54), ( Ey)R 0 {f, y); and by (57), 

(b) 0 Ey)TMo>f>yY 

Also by (a) and the simple consistency ((56)), 

(c) _ F^TW). 

Hence by (55), {Ey)R 1 {f, y); by (58), {Ey)T x {{t) x , f, y); whence 

(d) {y)fM)v t> y). 

By (b) and (d), (Ey)[T x {{f) 0 , f, y) & {z) z ^T x {(f) x , f, *)], i.e. {Ey)W x {f, y); 
and by (53), b ~>B(f), contradicting (c). Hence, rejecting the assumption 
(a) by reductio ad absurdum, b B(/). 

By similar steps, or simply by observing the symmetry between 
(52), (54), (56), (57) and (53), (55), (56), (58): b -iB(f). 
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Discussion. We motivated the conditions for S by suggesting that 
B(x) should express (Ey)W 0 (x, y) and ~i B(x) its negation. Let us call 
this, for convenience, the preferred interpretation. Under the preferred 
interpretation, -iB ( f ) corresponds to the A q ( q ) of §42 and expresses a 
true proposition. However the preferred interpretation is not mentioned 
in the theorem itself. The conditions for S are entirely symmetrical. 
There is nothing to keep us equally well from interpreting -iB(x) as 
expressing {Ey)W 1 {x, y) and B(x) the negation of that. Then B(/) cor¬ 
responds to A q { q ) and expresses a true proposition, while -iB ( f ) is false. 
Between these extremes, there are many intermediate possibilities for the 
interpretation. We shall illustrate this further in connection with the 
following examples of systems S for Theorem XV. 

Example 1. In the number-theoretic formalism of Chapter IV, 
since T 1 ((a) 1 , a, b ) and T 1 ((a) 0 , a, c) are primitive recursive, by Corollary 
Theorem 27 § 49 they are numeral wise expressed by formulas A(a, b) 
and B {a, c), respectively. Let B(x) be 3/>[A(x, b) & Vc(c<,b 3 -iB(x, c))], 
and ~iB(x) be -i3£[A(x, b) & Vc(c<,b 3 -iB(x, c))]. Then (52) can be 
shown to hold by methods used in the first part of the proof of Theorem 
29 § 42. Also by steps shown there and 3-introd., 
h B(x) 3 -iVi>[-iA(x, b) V3 c(c<b & B(x, c))]. Contraposing (by *13), 
u vi>[-iA(x, b) V 3c(c<i» & B(x, c))] 3 -iB(x). This with the method 
of the second part of the proof of Theorem 29 gives (53). Note that this 
proof requires, besides the predicate calculus with equality, Axioms 
14 — 21 and Corollary Theorem 27, only *166a, *168, *166 and *169 
(with t a numeral). Hence by Lemma 18a (end §41) and Lemma 18b 
(end § 49), it also holds good for the system of Robinson. For the 
number-theoretic system of Chapter IV, we have (54) and (55) for some 
recursive R 0 and R v by Theorem 31 § 52 (and for Robinson’s system, 
by the method of the proof of Theorem 31). In this example, the -> of 
~iB(x) for Theorem XV is actually the -i of the number-theoretic for¬ 
malism ; and under the usual interpretation of the number-theoretic sym¬ 
bolism, B(x) and ~iB(x) have the preferred interpretation. Now let S 
be the number-theoretic system of Chapter IV (or Robinson’s), with 
this choice of B(x) and -iB(x). By the theorem, S (if it is simply con¬ 
sistent) is simply incomplete, and so is every simply consistent en¬ 
largement -of 5 obtained by adding more postulates (in such a way that 
(54) and (55) still hold for some recursive R 0 and R 1 ). Such an enlargement 
of S may even be at variance with the preferred interpretation of B(x) 
and ~iB(x) in S, provided only that the new postulates do not conflict 
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with the preferred interpretation in a sufficiently elementary way as to 
give rise to a simple inconsistency. 

Example 2 . Let the symbolism for a formal system S include the 
numerals and four predicate symbols W Q , W v S and -iS (or in place of 
the last, an operator -i). Let S have as its postulates two axiom schemata 
and two rules of inference, as follows. For Axiom Schema 1 , x and y 
are numerals such that W 0 (x, y), and for Axiom Schema 2 such that 
W l (x. y). 

1 . W 0 (x,y). 2 . W^x.y). 

3 Y ± (x ’ 4 r i( x - y) 

S(x). ' -iB(x). 

The simple consistency of this system is immediate from (51). This 
system S, or any simply consistent extension of it obtained by adding 
more postulates (with (54) and (55) remaining true for some recursive 
R 0 and R x ) is simply incomplete. In this S the formulas B(x) and iB(x) 
are provable only as required by (52) and (53). We are unrestricted by 
the interpretation in enlarging S. 

Theorem XI IT § 60 is the case of Theorem V § 57 for the predicate form 
(Ey)R(x, y), in a metamathematical application. Likewise Theorem XV 
admits a version entirely in terms of predicate forms. Let us compare 
Theorems XIII and XV using the language of recursively enumerable 
classes (cf. Theorem XIV § 60). 

Remark 1 . Theorems XIII and XV make it absurd that any of 
the three classes ^(EyjT^x, x, y), x{Ey)W (i {x > y) and x{Ey)W l {x, y) be 
empty. — Members can be found thus. Using (29) Theorem IX § 58, 
(Fy)7' 1 (<?, e, y). Choose any recursive R such that (x)(Ey)R(x, y), and 
choose / for this R by (6) Theorem IV § 57; also choose any recursive R 
such that (x)(y)R(x, y), and choose g for this R by (7). Let e 0 = 2°-3 > 
and e t = 2 f -3°. Then {Ey)W 0 (e 0 , y) and (Ey)W 1 (e v y). 

In Theorem XIII we have a fixed recursively enumerable class C 0 
of natural numbers (namely x(Fy)7\(x, x, y)) whose complement C 3 
(= x(y)T 1 (x, x, y)) is not recursively enumerable (Figure 1 ). In Theorem 
XV we have two fixed recursively enumerable classes C 0 and C x (namely, 
x(Ey)W^x, y) and x{Ey)W 1 (x,y), respectively), which are disjoint (by 
(51)) and such that, for every separation of all natural numbers into two 
disjoint classes C 2 and C 3 with C 0 C C 2 and C l C C 3 , the classes C 2 and C 3 
are not both recursively enumerable (Figure 2). (Instead of (x) (x £ C 2 V 
x £ C 3 ) it suffices to have (x)(x £ C 2 V x £ C 3 ), which is weaker intuition- 
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istically.) Only for the preferred interpretation is C 3 the complement of 
C 0 . In proving Theorem XIII (Part II), we supposed given any recursively 


C n 


C a 


Figure 1. 


c. 


c % 



Figure 2. 


enumerable class D 3 contained in C 3 (namely, x(Ey)R(x, y)), and found a 
number / contained in neither C 0 nor D 3 (Figure la). In proving Theorem 
XV, we supposed given two disjoint recursively enumerable classes D 2 
and D z containing C 0 and C v respectively (namely, x{Ey)R 0 (x, y) and 
x(Ey)R 1 (x, y)), and found a number / contained in neither Z ) 2 nor D s 
(Figure 2 a). The metamathematical phraseology in the above proof can 


of course be bypassed, by first using (54) and (55) in (52), (53) and (56), 
so that the hypotheses become 

(52a) (Ey)W 0 (x,y) -+ {Ey)R 0 {x,y), (53a) (Ey)W 1 {x,y) -> (Ey)R 1 (x,y), 

(56a) {Ey)R 0 (x, y) & (Ey)R 1 {x, y) 

(Kleene 1950 ). These results extend to classes enumerable recursively in 
; cf. end § 60 and Theorem X § 58. 

The numbers x for which B(x) is provable are recursively enumerable 
((54), Theorem XIV; by (52) and Remark 1 , |- B(e 0 )). 

Theorem XVI. If S as described in Theorem XV (omitting (55)) is 
simfly consistent, the numbers x for which B(x) is unprovable in S are 
not recursively enumerable, or equivalently there is no general recursive 
predicate Q(x, y) such that (Ey)Q(x, y) = \- B(x). (After Rosser 1936 .) 

Proof. For if #[|-B(x)] were recursively enumerable, then taking 
C 2 = x[y B(x)] and C 3 = x[\- B(x)], we would have the situation shown 
in Figure 2 with C 2 andC 3 both recursively enumerable. (Intuitionistically, 
(x) (x £ C 2 V x 8 C 3 ); cf. *51 a § 27.) The proof may also be given by observ¬ 
ing that (52) — (56) would hold replacing “R 1 ” by “Q” and “ (- -iB(x)’' 
by " h B(x)”, whereupon the former proof that j- B (/) & |- “>B ( f ) becomes 
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Figure la. 
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a deduction of the logical contradiction |- B(/) & b B(f). (We use the 
hypotheses regarding -iB(x) and the simple consistency only to infer 
that (Ey)W 1 (x,y) -> b B(x).) 

The decision problem for a formal system (cf. §30). By Thesis I 
§ 60, ‘effectively decidable (calculable)’ means ‘general recursive’ in the 
case of a number-theoretic predicate (function). To give an exact sense to 
‘effectively decidable (calculable)' for a metamathematical predicate 
(function), we can require that the corresponding number-theoretic 
predicate (a corresponding number-theoretic function) be general re¬ 
cursive, in the case of any particular formal system S, the objects of 
which admit an effective Godel numbering (as they must, if S is to serve 
our purpose in formalizing, beginning § 60). For example, to give a 
‘decision procedure’ for provability in S, i.e. for the predicate 1- A (where 
A is a metamathematical variable ranging over all formulas, or over all 
formal objects, of S), then means to give a general recursive predicate 
R(a) such that R(a) = b A a , in the notation of Remark 1 § 60. (A second 
method of making ‘effectively decidable (calculable)’ exact for a meta¬ 
mathematical predicate (function) is indicated at the end of § 70.) The 
following can be read either in terms of our intuitive conception of a 
decision procedure applying to formulas of S, or in terms of this exact 
mathematical definition. The condition that B(x) be an effective meta¬ 
mathematical function of x (or that its Godel number (3(*) be a general 
recursive number-theoretic function of x) must be met, if S is to serve 
the purpose of formalization for the formulas B(x). 

Corollary. Let S be as described in Theorem XV ( omitting (54) and 
(55)), and such that B(x) can be effectively found from x {or that in some 
specified effective Godel numbering, its Godel number $(x) is a general re¬ 
cursive function of x). If S is simply consistent, then its decision problem is 
unsolvable, i.e. there is no decision procedure for-determining whether a 
formula is provable in S. 

Proof. For if there were a method for determining effectively 
whether any given formula of the system is provable, one could, given 
any number x, find the corresponding formula B(x) and then apply the 
method to that formula. By Thesis I § 60, this would imply that the class 
x[bB(x)] is general recursive. Then a fortiori the classes x[ b B(x)] and 
£[b B(x)] would both be recursively enumerable, and hence there would 
be general recursive predicates R 0 and Q such that ( Ey)R 0 {x, y) = b B(x) 
(for (54)) and {Ey)Q{x,y) = b B(x) (cf. end §60, and Remark 1), con- 



314 


GENERAL RECURSIVE FUNCTIONS 


CH. XI 


tradicting the theorem. — Otherwise stated: Were there a general re¬ 
cursive R such that R(a) = 1 - A tt , then taking R 0 (x, y) == R($(x)) and 
Q(x, y) = R(fi(x)), we would contradict the theorem as above. 

Theorem 33. If the number-theoretic formal system of Chapter IV 
(or Robinson’s system described in Lemma 18b §49) is simply consistent, 
then its decision problem is unsolvable, and remains unsolvable when the 
system is extended by adding postulates in any way such that the system 
remains simply consistent. 

By the corollary with Example 1 . Since in Example 1 the -i of -iB(x) 
is the -i of the number-theoretic system, ‘simple consistency’ as used in 
Theorem XV coincides with ‘simple consistency’ as defined (in § 28) for 
the number-theoretic system. (For the definition of ‘decidable’ by use 
of a Godel numbering, it may be understood that the numbering is that 
of §§ 50, 52. Using Example 2 § 52, then (}(*) is primitive recursive.) 

Reductibility, degrees of unsolvability. Much of the work on de¬ 
cision problems is devoted not to outright solutions but to reductions 
of one decision problem to another. 

To ‘reduce’ the decision problem for a predicate P (or the calculation 
problem for a function 9) of « variables to the respective problems for 
l functions and predicates , +1,. Qi> • • •» Qi 2 (abbreviated T) means 

intuitively to find a uniform method of procedure by which, given any 
w-tuple of arguments x v ..., x n , one could decide whether or not 
P(x v ...,%„) is true (calculate the value y(x v ..., *„)), if, at each stage 
of the procedure, he had available the values of the functions ^1, ..., 4> tl 
and the truth or falsity of the values of the predicates Q v ..., Q lt , for 
such arguments as he might then name; or briefly, to establish that 
P is effectively decidable (9 is effectively calculable) from T. 

To obtain a precise mathematical notion to correspond to this intuitive 
notion, we naturally extend Church’s thesis (Thesis I § 60) to include the 
case of / > 0 assumed functions and predicates T (calling it then Thesis 
I*). The evidence for Thesis I will also apply to Thesis I*. The converse 
of Thesis I* holds. 

For example, to reduce the decision problem for a predicate P(a) 
to that for another predicate Q(a) now means to find a predicate R(a) 
general recursive in Q(a) and such that P(a) = R(a), or briefly to establish 
that P(a) is general recursive in Q(a). We infer this from our intuitive 
notion of reduction by Thesis I*; or if we take it as a definition, we appeal 
to Thesis I* in asserting that the defined notion agrees with our intuitive 
conception of reduction. 



§61 


A SYMMETRIC FORM OF GODEL’s THEOREM 


315 


Post 1944 formulated several mathematical reducibility concepts. 
The most general of them, which Post takes from Turing 1939 , is equiva¬ 
lent to the notion we obtain from Thesis I*. If the decision problem for 
P(a) is reducible to that for Q{a)„ and is unsolvable, Post says further 
that it is of equal or lower degree of unsolvability than that for Q{a) ac¬ 
cording as the decision problem for Q (a) is or is not reducible to that for 
P(a) (cf. § 3 ). The degrees of unsolvability are at least partially ordered 
(cf. § 8 ). Each of the predicates (Ex)T x (a, a, x), (x)(Ey)T 2 (a, a, x, y), 
(Ex)(y)(Ez)T 3 (a, a, x, y, z), ... (cf. Theorem V Part II (b) §57) has a 
decision problem of highest degree of unsolvability for predicates of the 
respective forms (Ex)R(a,x), (x)(Ey)R(a,x,y), (Ex)(y)(Ez)R(a,x,y,z),... 
with R recursive, as we shall prove later (Example 2 §65; this holds 
intuitionistically, if taken to mean simply that each predicate of the 
respective one of the forms with a recursive R is recursive in that 
non-recursive predicate, which is what is directly proved). Classically 
then each of these predicates after the first has a decision problem of 
higher degree of unsolvability than the preceding (using also Corollary 
Theorem XI § 58), and the predicate M(a, k) of Theorem VIII § 57 has 
one of still higher degree of unsolvability (by Theorem X Corollary (b) 
§ 58 and the proof of Theorem VIII). Using T% instead of T n , we obtain 
degrees of unsolvability ascending from that (by Theorem XI* end § 58 
and Example 2 §65 for l > 0 ). Davis (1950 abstract) explores those 
degrees of unsolvability. Post 1944 raises, without answering, the 
question whether there exists a lower degree of unsolvability than 
that of the decision problem for (Ex^^a, a, x). 

Example 3. Let Theorem 33 be restated using the stronger con¬ 
sistency property of Example 1 (or 2) § 60 in place of simple consistency. 
In this weaker form Theorem 33 can be proved from Example 1 (or 2 ) § 60 
by taking the R(x, y) there = T x (x, x, y) and using Theorem XII § 60. 
From this proof it follows, by the results of Example 2 § 65 just cited, 
that, under the stronger consistency hypothesis, the decision problem for 
the formal system of Chapter IV (or Robinson’s, Lemma 18b), or for any 
extension having that consistency property for the same or some other 
R(x, y) which numeralwise expresses R{x,y) (= T x {x, x, y)), is of the 
highest degree of unsolvability for 1 -quantifier predicates. — For the 
system of Chapter IV (or Robinson’s), we can also show this (with 
another choice of R(x, y) and R(x, y) in the stronger consistency hy¬ 
pothesis) from the proof of Theorem 33 as given, thus. Let / be chosen 
by ( 6 ) with T 1 ((x) 2 , (x) 2 , y) as the R, and g be chosen by (7) for any recur- 
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sive R such that (x)(y)R(x, y). Then (EyjT^x.x.y) == (Ey) W Q (2 Q -y-5 x , y). 
This reduces the decision problem for (£y)7' 1 (* > x, y) to that for 
{Ey)W 0 (x, y). So the decision problem for the system is of the highest 
degree for 1-quantifier predicates, if the system has the property that 
the converse of (52) holds for the B(x) of Example 1 (as Robinson’s 
system does, by Theorem 53 (c) § 79). (This B(x) is of the form 3yR(x, y) 
where R(x, y) numeralwise expresses W 0 (x, y), by § 41 (C) and (E).) 

In the case T are unspecified functions and predicates, one may discuss 
the reduction of the decision problem for P (the calculation problem for <p) 
to the respective problems for Y in the sense of obtaining a procedure 
uniform in Y as well as in x v ..., x n ; or briefly, of establishing that P 
is effectively decidable (9 is effectively calculable) uniformly from Y. 
Then Thesis I* and Converse are to be stated reading "uniformly” in 
hypothesis and conclusion. 

Note i. Proof of (a) bottom p. 292 should be amplified as follows. The application 
of Corollary Theorem I* leads to (I) an expression for the predicate R in terms of 
the logical operations of the predicate calculus with number variables, 0, + , *, — 
and the representing functions i|>i, 4i °f the predicates Y. By the method of 
proof of Theorem I* and Corollary, each of 4i> ..., 4* enters into that expression 
only in parts of the form “4(ai, ..., a m ) = w”. Each such part can be replaced by 
t ‘{Q{a v ..., a m ) Sc w = 0} V {Q(a v ..., a m ) Sew — 1}” where Q is the predicate re¬ 
presented by 4- Thus we obtain (II) an expression for the predicate R in terms of the 
logical operations of the predicate calculus with number variables, 0, +, •, = and 

the predicates T themselves. Now we are ready for the final step (p. 292). — 
Actually, an expression (I) exists (which is what *R is arithmetical in Y' means for 
predicates Y, under our definition p. 239 with bottom p. 291) if and only if an ex¬ 
pression (II) exists. Why? 
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§ 62. Church’s thesis. One of the main objectives of this and the 
next chapter is to present the evidence for Church’s thesis (Thesis I 
§60). 

Since our original notion of effective calculability of a function (or 
of effective decidability of a predicate) is a somewhat vague intuitive 
one, the thesis cannot be proved. 

The intuitive notion however is real, in that it vouchsafes as effectively 
calculable many particular functions (§ 30), and on the other hand 
enables us to recognize that our knowledge about many other functions 
is insufficient to place them in the category of effectively calculable 
functions. 

For an example of the latter, let R(x, y) be an enecuvely decidable 
predicate, and consider the function eyR(x, y) (Godeli93i) defined clas¬ 
sically thus, 

D/ . f the least y such that R(x, y), if ( Ey)R(x, y), 

*yR(*. W = } o, otherwise. 


This definition does not (of itself) provide a calculation procedure. Given 
x, we can search through the propositions R(x, 0), R(x, 1), R(x, 2), ... 
in succession, looking for one that is true, as far as we please; i.e. we can 
in principle complete the examination of the first n of them, for any finite 
». If the given x is such that ( Ey)R(x , y), by persisting long enough we 
shall eventually encounter a first y for which R(x, y) is true, which y is 
the value of the function eyR(x,y). But if x is such that (Ey)R(x, y), 
we shall never learn this by persisting in the search, which will remain 
forever uncompleted. The completion of the examination of all Kq 
propositions, which the classical definition envisages, is impossible for 
a human computer. 

For some choices of R(x,y), the function tyR(x,y) may nevertheless 
be effectively calculable, not “immediately” on the basis of its definition, 
but because of the existence of some other procedure for determining the 
value, which unlike the one suggested by the definition itself is effective. 
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For example, when R(x, y) = (x) 0 (y) 0 - J t -(x) 1 (y) 1 =(x) 2 , there is known 
to be such a procedure (cf. Example 2 § 30). 

The function syR(x, y) is effectively calculable, if and only if the 
predicate ( Ey)R(x , y) is effectively decidable. For if (Ey)R(x, y) is effec¬ 
tively decidable, then given x, to calculate zyR(x, y), we can first decide 
whether or not (Ey)R(x, y) is true, and according to the answer either 
search for the least y such that R(x, y) or take 0 as the value. Conversely, 
if zyR(x, y) is effectively calculable, then given x, we can decide whether 
or not ( Ey)R(x, y) is true, by first calculating zyR(x, y), and then ascer¬ 
taining whether or not R(x, zyR(x, y)) is true. 

The intuitionist finds no justification for the belief that we can always 
tell, for a given predicate P(y), whether or not ( Ey)P(y). This is his 
ground for not accepting the law of the excluded middle A or not A with 
his meaning of "or” (§ 13). His argument, applied to R(x, y) as the P(y), 
is an argument that we have no basis to suppose that, for any R, the pred¬ 
icate (Ey)R(x, y) is effectively decidable. 

Church's thesis, by supplying a precise delimitation of ‘all effectively 
calculable functions', makes it possible to prove, for certain predicates 
R(x, y), e.g. T^x, x, y) (Theorem XII § 60), that there is no uniform method 
of solving the problem whether or not (Ey)R(x, y). Thereby Brouwer’s 
argument, that Hilbert’s belief in the solvability of every mathematical 
problem is unproven, is now strengthened to an actual disproof, when 
solvability is taken to mean uniform solvability and Church’s thesis is 
accepted. The relationship of Church’s thesis to intuitionism will be 
discussed further below (§ 82). 

The intuitionist does not regard the definition given above for eyR(x, y), 
lacking a proof of effective calculability, as properly defining a function. 
But our discussion of eyR(x, y) can refer intuitionistically to the predicate 
{/?(%, w) & (z) z<w R(x, z )} V {(Ey)R(x, y) & w=0}. Classically, this pred¬ 
icate, call it “P(x,w)”, is the representing predicate of syR(x,y). But 
intuitionistically, we may not be able to prove that (x)(E\w)P(x, w), 
i.e. that P(x, w) is a representing predicate of a function (cf. § 41; 
(x)[(Ew)P(x, w) = (E\w)P(x,w)], cf. *174b, *171). 

While we cannot prove Church’s thesis, since its role is to delimit 
precisely an hitherto vaguely conceived totality, we require evidence 
that it cannot conflict with the intuitive notion which it is supposed to 
complete; i.e. we require evidence that every particular function which 
our intuitive notion would authenticate as effectively calculable is general 
recursive. The thesis may be considered a hypothesis about the intuitive 
notion of effective calculability, or a mathematical definition of effective 
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calculability; in the latter case, the evidence is required to give the 
theory based on the definition the intended significance. 

The converse of Church’s thesis, i.e. that every general recursive 
function <p is effectively calculable, we take to be already confirmed by 
the intuitive notion (cf. § 60). We use here the definition of ‘E defines 
9 recursively' which says that, given x v . .., x n , a deduction from E of 
an equation f(x 1( ..., x„)=x expressing that the value <p(x v ...,x„) 
is x always exists (or if we base the computation procedure on Theorem 
IX (30), we use (29); or if we base it on Corollary Theorem IX, we use ( 1 )). 
In concluding that we have an effective computation procedure, the 
existential quantifier which appears in the definition of ‘E defines 9 recur¬ 
sively’ (or in (29), or in ( 1 )) must be understood constructively (§ 13); 
and likewise the existential quantifier in the definition of ‘9 is general 
recursive’, which states that there exists an E defining 9 recursively (or 
in Theorem IX, that a Godel number e can be found; or in Corollary 
Theorem IX that a finite sequence of applications of (I) — (VI) can be 
found). 

In other words, we should not claim that a function is effectively 
calculable oq the ground that it has been shown to be general recursive, 
unless the demonstration that it is general recursive is effective (cf. 
Church 1936 Footnote 10 ). 

We now summarize the evidence for Church’s thesis (and Thesis I*, 
end § 61) under three main headings (A) — (C), and one other (D) which 
might be included under (A). Some of this evidence will be given in more 
detail in later sections. 

(A) Heuristic evidence. 

(Al) Every particular effectively calculable function, and every 
operation for defining a function effectively from other functions, for 
which the question has been investigated, has proved to be general re¬ 
cursive. A great variety of effectively calculable functions, of classes of 
effectively calculable functions, and of operations for defining functions 
effectively from other functions, selected with the intention of exhausting 
known types, have been investigated. 

(A 2 ) The methods for showing effectively calculable functions 
to be general recursive have been developed to a degree which virtually 
excludes doubt that one could describe an effective process for de¬ 
termining the values of a function which could not be transformed by 
these methods into a general recursive definition of the function. 

(A3) The exploration of various methods which might be expected 
to lead to a function outside the class of the general recursive functions 
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has in every case shown either that the method does not actually lead 
outside or that the new function obtained cannot be considered as ef¬ 
fectively defined, i.e. its definition provides no effective process of cal¬ 
culation. In particular, the latter is the case for the Cantor diagonal 
method. (An illustration of the former will be given in Example 1 § 65 .) 

(B) Equivalence of diverse formulations. 

(Bl) Several other characterizations of a class of effectively cal¬ 
culable functions with the same heuristic property ((A)) exist. These 
have turned out to be equivalent to general recursiveness, i.e. the classes 
of functions which they describe are coextensive. 

In fact three notions arose independently and almost simultaneously, 
namely general recursiveness, X- definability (successive steps toward 
which were taken by Church 1933 and Kleene 1935; cf. Church 1941) 
and computability (Turing 1936-7, Post 1936). The equivalence (i.e. co¬ 
extensiveness) of the X-definable functions with the general recursive 
functions was proved by Church 1936 and Kleene 1936a (also cf. the ref¬ 
erence to work of Rosser in Church 1936 Footnote 16 ). The equivalence 
of the computable to the X-definable functions (and hence to the general 
recursive functions) was proved by Turing 1937. 

The notion of a function reckonable (§ 59 ) in a certain formal system 
Si described (very briefly) in Godel 1936 is a fourth equivalent of general 
recursiveness, under the hypothesis that S x is simply consistent (as 
Rosser remarked in a review 1936a). 

Still another approach is given by Post (1943, 1946) in terms of what 
he calls canonical and normal systems. What this gives directly, as 
it is presented, is an equivalent of recursive enumerability, but then as 
in Example 6 § 60 we obtain an equivalent of recursiveness. 

The fact that several notions which differ widely lead to the same class 
of functions is a strong indication that this class is fundamental. 

(B2) Of less weight, but deserving mention, is the circumstance 
that several formulations of the main notions are equivalent; i.e. the 
notions possess a sort of “stability”. 

Thus, for general recursiveness, the formalism may be chosen in 
several ways (§ 55 ). Also one may give a formulation ( ^-recursiveness) 
not based on any formalism but using instead Schemata (I) — (VI) 
(Theorem III § 57 and Corollary Theorem IX § 58 ), or one using Schemata 
(III), (IV) and (VI) with x-\-y, x-y and 8 * (— 1 if x — y, — 0 if x =£y) as 
initial functions (Kleene 1936b). (Also cf. Julia Robinson 1950.) 

The notion of X-definability has the variants X-K-definability (studied 
by Rosser, cf. Kleene 1936a Footnote 12 ) and X- 8 -definability (Church 
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1935). Also there is a parallel development, started by Schonfinkel 
1924 and Curry (1929,1930,1932) and continued by Rosser (1935,1942a*) 
(also cf. Curry 1948-9), which leads to a notion that we may call com¬ 
binatory definability, proved equivalent to X-definability by Rosser. 

The details of the definition of computability can also be varied, 
as we shall see later (Chapter XIII). 

The system S l of Godel 1936 is the first system in a hierarchy of systems 
Si (t — 1 , 2 , 3 , ...) using successively higher types of variables (cf. § 12 ). 
Godel remarks, “It can be shown moreover, that a function which is 
reckonable in one of the systems S f , or even in a system of transfinite 
order, is reckonable already in S v so that the concept ‘reckonable’ is in 
a certain sense ‘absolute’, while almost all hitherto known metamathe- 
matical concepts (e.g. provable, definable, etc.) depend very essentially 
on the system which is taken as the basis.” Exactly the same functions 
are reckonable in our number-theoretic system of Chapter IV or Rob¬ 
inson’s described in Lemma 18 b §49 (by Theorem 32 §59 and the 
equivalence of reckonability in Godel’s S x to general recursiveness). The 
equivalence of reckonability in these systems is under the hypothesis of 
simple consistency for Godel’s systems and our system of Chapter IV. 
(The simple consistency of Robinson’s system will be proved as Theorem 
53 (a) § 79 .) Two other systems (Z°) and (Z 00 ) having the same class of 
reckonable formulas are given in Hilbert-Bernays 1939 Supplement II; 
these are formalizations of (^.-recursiveness ((Z 00 ) utilizing also the normal 
form). (Mostowski 1947 bases his version of Theorem V § 57 on the notion 
of resolvability of a predicate P in a system S (§ 59 ), with 5 subject 
only to some quite general conditions. As we shall see below in connection 
with (Dl) and Thesis II, only a general recursive function can be reckon¬ 
able in an S which is a formal system with effective rules; but Mostowski 
considers also non-constructive generalizations of formal systems in 
our sense.) 

(C) Turing’s concept of a computing machine. 

Turing’s computable functions (1936-7) are those which can be computed 
by a machine of a kind which is designed, according to his analysis, 
to reproduce all the sorts of operations which a human computer could 
perform, working according to preassigned instructions. Turing’s notion 
is thus the result of a direct attempt to formulate mathematically the 
notion of effective calculability, while the other notions arose differently 
and were afterwards identified with effective calculability. Turing’s 
formulation hence constitutes an independent statement of Church’s 
thesis (in equivalent terms). Post 1936 gave a similar formulation. 
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The work referred to under (A) (especially (Al)) was not all carried 
out originally for general recursiveness or the special notions of 
recursiveness subsumed under general recursiveness (§ 55 ), but much of 
it was done for X-definability (in Kleene 1935) or computability (in Turing 
1:936-7). But by (B) the heuristic and other evidence accumulated in study¬ 
ing the various notions all applies to any one. The accumulation of 
methods shown to be general recursive under (Al) contributes to (A 2 ). 

The case under (A2) will be presented in this chapter in connection 
with the theory of partial recursive functions (cf. § 66). We shall take 
up computability in the next chapter, proving the equivalence of com¬ 
putability to general recursiveness in §§ 68, 69 (cf. (Bl)) and incidentally 
the equivalence of some differing formulations of computability (cf. (B 2 )), 
and giving the evidence under (C) in § 70 . 

(D) Symbolic logics and symbolic algorithms. 

Church 1936 gave the following arguments (in substance), as showing 
“that no more general definition of effective calculability than that 
proposed above can be obtained by either of two methods which naturally 
suggest themselves” (p. 358 ). 

(Dl) Suppose that we are dealing with a function <p(.r) and a formal 
system such that the following is true. The set of the axioms is finite 
or (if infinite) effectively enumerable, and likewise the set of the rules of 
inference; and each rule of inference is an effectively performable oper¬ 
ation. We can effectively recognize a formula P(x, w) which attributes 
a number w as value to 9 for a given argument x, and effectively read 
from it this number. The formulas P(x, w) attributing the correct 
and only the correct values to 9 are provable in the system; i.e. 9 is 
‘reckonable’ § 59 (except that here we are not insisting that P(x, w) 
come from some P(x, w) by substituting x, w for x, w). If the interpre¬ 
tation is allowed that the effectiveness of the metamathematical functions 
and predicate just mentioned implies that the number-theoretic functions 
and predicate corresponding to them under a suitable Godel numbering 
are general recursive, then 9 is general recursive. For by reasoning 
as in the proof of Theorem IX § 58 or (c) § 59 , for some general recursive 
^ and R, ( x)(Ey)R(x, y) and y(x) = ^( \xyR{x , y)); whereupon Theorem III 
§ 57 applies. 

(D2) Consider a symbolic algorithm for the calculation of the values 
of a function <p(x), which shall consist in a method by which, given any 
x, a finite sequence E*o, E*i, ..., E xr% of expressions (in some notation) 
can be obtained, in the following fashion. Given x, the first expression 
E z0 can be effectively found. Given x and the expressions E xi for i < j, 
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it can be effectively recognized whether the algorithm has terminated 
(i.e. whether j — r x ), and if so, the value cp(x ) can be effectively found; 
while in the contrary case, the next expression E I ; . +1 can be effectively 
found. Again, if the effective functions and predicate described become 
general recursive under some Godel numbering, then cp is general 
recursive. For we can reason as in (Dl), regarding (x, EJ now as analogous 
to an axiom, and the operation of passing from (x, E w , ..., E x >) to 
(x, E m , .... E xj> E x) . +1 ) as analogous to a rule of inference. 

In brief, (Dl) and (D2) show that if the individual operations or rules 
of a formal system or symbolic algorithm used to define a function are 
general recursive, then the whole is general recursive. So we could in¬ 
clude (Dl) and (D2) as particular examples of operations or methods of 
definition under (Al). 

Note that (Dl) and (D 2 ) refer to formal systems and symbolic al¬ 
gorithms having a special kind of structure, exemplified by particular 
formal systems and algorithms we know. We have elsewhere (§§ 30 , 60 , 61 ) 
used “algorithm” more broadly to mean any calculation (or decision) 
procedure; and we generalized the notion of a formal system likewise in 
connection with Thesis II and Theorem XIII (§ 60 ). There is of course 
no circularity in adding the evidence provided by algorithms and formal 
systems of the special sorts to the case for Church’s thesis, and afterwards 
applying the thesis (as in §§ 60 , 61 ) to the discussion of algorithms and 
formal systems in the broader sense. 

If we consider only systems satisfying Thesis II (for »+1 variables), 
the functions (of n variables) which are reckonable in various formal 
systems (i.e. each one in some system) are all general recursive, and 
hence all are reckonable in one system (e.g. any one of the systems 
mentioned under (B2)). 

§ 63. Partial recursive functions. As at the beginning of § 62 , let 
R(x, y) be an effectively decidable predicate. Consider the procedure which 
consists, for a given *, in deciding as to the truth or falsity of each of the 
propositions R(x, 0 ), R(x, 1), R(x, 2), .... successively, until one is 
found to be true, and taking the second argument y of that one R(x, y). 
This procedure leads to a natural number y in a finite number of steps, 
if (Ey)R(x, y) and only then. Therefore it can be considered as an al¬ 
gorithm for calculating a mathematical function of x defined over the 
subset x(Ey)R(x, y) of the natural numbers. The function calculated is 
‘the least y such that R(x, y)' or in symbols '\i.yR(x, y)’. 

It may be impossible to extend the definition of this function |xy/?(.v, y) 
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to all natural numbers, in such a way that there will be an algorithm for 
calculating the resulting completely defined number-theoretic function. 
We noted in § 62 that the particular extension zyR{x, y) is effectively 
calculable, if and only if the predicate ( Ey)R(x, y) is effectively decidable; 
and the method shows that no extension of [iyR(x, y) to all natural num¬ 
bers is effectively calculable, unless (Ey)R(x, y) is effectively decidable. 

This can be stated in terms of the theory of general recursive functions. 
Church 1936 called a function (f(x 1 , .... x n ) defined over a subset of the 
M-tuples of natural numbers potentially recursive, if there exists a general 
recursive function <?'{x v ..., x n ) such that <p'(x v ...,*,) = y(x v ..., x n ) 
for each n-tuple x v ...,*„ for which <p(x v ...,x n ) is defined. Now 
if R(x, y) is general recursive, then \iyR(x, y) is potentially recursive if 
and only if ( Ey)R(x, y) is general recursive. For 

( 59 ) eyR(x, y) = \iw[R{x, w) V {(Ey)R(x, y) & w=0}] ; 

and therefore by #D § 45 and Theorem III § 57 , if (Ey)R{x, y) is gener¬ 
al recursive, then syR{x, y) is a general recursive extension <p'(x) of 
fj .yR(x, y). Conversely, if y.yR(x, y) is potentially recursive with y'(x) as a 
general recursive extension, then (Ey)R(x, y) = R(x, <p'(x)), and therefore 
(Ey)R(x, y) is general recursive. 

Example 1. Hence by Theorem V ( 15 ) § 57 , \xyT x (x, x, y) is not 
potentially recursive (Kleene 1938, 1943*), and s yT x {x, x, y) is not general 
recursive (Kleene 1936). 

An algorithm for calculating a function cp may, for a given x, fail to 
lead to a number as value of y{x) either by not terminating (so that no 
matter how many steps have already been performed, the rules of the 
algorithm call for a next step), or by terminating but without giving a 
number as value. We can modify any given algorithm so that whenever, 
for a given x, the given algorithm terminates without producing a number 
as value, the new algorithm gives 0 as value. The new algorithm calculates 
an extension 9' of 9 defined exactly when the original (and the new) 
algorithm terminate. 

Example 1 (continued). Hence any algorithm which will lead to 
the number \iyT x (x, x, y) for every x such that {Ey)T x {x, x, y) cannot 
terminate for every x (using Thesis I § 60 ). 

If there is an algorithm for deciding, given x, whether the function 
<p(x) calculated by a given algorithm is defined or not, then a new al¬ 
gorithm can be set up which calculates an extension <p'{x) of y(x) to all 
natural numbers. (Also cf. Example 5 § 64 .) 
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Example 1 (continued). Hence there is no algorithm for deciding, 
given x, whether [LyT^x, x, y) is defined or not, as we can also see directly 
from its condition of definition (Ey^T^x, x, y) (cf. Theorem XII § 60 ). 

We can also modify a given algorithm without extending the function 
9 {x) so that, for a given x, the algorithm will always fail to terminate 
whenever y(x) is undefined (as is already the case for the algorithm for 
\iyR{x, y) described above). To do so, we arrange that, when the given 
algorithm terminates without giving a number as value of y{x), the new 
algorithm calls for additional steps which will continue ad infinitum. In 
discussing algorithms in the rest of this chapter, we shall often tacitly 
assume that the algorithms are of this kind. 

Suppose that %(x) is defined over all natural numbers, and 'y(v) over a 
proper subset of them containing all the numbers taken as values by y[x ), and 
that both functions are effectively calculable. Then ^(x(v)) completely de¬ 
fined and effectively calculable. The function ty(x) might be [lyT^x, x, y). 
Thus an effectively calculable function restricted to a proper subset of the 
natural numbers may be useful in constructing another effectively calcula¬ 
ble function defined over all natural numbers. 

Also there are problems in foundations which call for a function, needed 
only on a proper subset of the natural numbers, to be effectively cal¬ 
culable. This occurs in the theory of constructive ordinals (Church- 
Kleene 1936, Kleene 1938, Church 1938), and in studies of the intu- 
itionistic logic (cf. § 82 ). 

These considerations indicate the desirability of including partially 
defined functions under our treatment of effective calculability. We 
shall accordingly extend the class of the general recursive functions to 
take in certain incompletely defined functions, calling the resulting class 
of functions the partial recursive functions’. The technical advantages 
of thus extending the class of the general recursive functions, even if our 
purpose were only to support Church’s thesis for the case of completely 
defined functions (Theses I and I*), will become fully apparent in §§ 65 
and 66. At the end of the present section, we use the partial recursive 
functions in stating Church’s thesis for the case of partially defined 
functions (Theses If and I*t). 

To fix our terminology, let us now call a function from any subset 
(proper or improper) of the ^-tuples of the natural numbers to the natural 
numbers a partial function. In other words, a partial function 9 is a func¬ 
tion which for each w-tuple x v ..., x n of natural numbers as arguments 
takes at most one natural number <f>{x v ..., x n ) as value. For an M-tuple 
x v .... x„ for which 9 has a natural number as value, we say 9 (or 
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9 (% 1 , .. x n )) is defined ; for an n-tuple x x , ..., x n for which 9 has no 
natural number as value, we say cp (or y{x x , .... x n )) is undefined (some¬ 
times written “u”). The range of definition of a partial function is the set 
of the «-tuples x v ..., x n for which <p(x v ..x n ) is defined. When this 
consists of all n-tuples, we have an ordinary (completely defined) number- 
theoretic function; otherwise an incompletely defined function. When it 
is empty, we have the completely undefined function. 

To obtain the definition of ‘partial recursive function’ (Kleene 1938 ), 
we adapt the Herbrand-Godel definition of ‘general recursive function’ 
to partial functions, as follows. 

For the case that 4h. . ..,4h are partial functions (of m v 
variables, respectively) we now understand naturally that (cf. 

§ 54) is the set of the equations g^yq, ..., yu)=y where 4'y(y 1 , ..., y mj ) 
= y, for all j = 1 , ..., l and in the case of each j for those w r tuples 
y v ..., y mj for which 4'i is defined. Then for a partial function 9 the 
definition of E defines 9 recursively in (or from) 4 ^, ..., t|/, given in § 54 
reads correctly, if (for emphasis) we now replace “if and only if 
(f>(x lt .,., x n ) — x” by “if and only if 9 (* 1 , ...,*„) is defined and 
9 (# 1 , ..., x n ) = x”. The second phrasing of the definition there (with 
a completeness and a consistency property) can be used, if we now under¬ 
stand Ej? for a partial function 9 in like sense to that just explained for 
Egj.i.g*- Finally we say (corresponding to the definition in § 55) that 
a partial function 9 is partial recursive in 4 i, ..., <J> 1( if there is a system 
E of equations which defines 9 recursively from 4»i, .... 4 ' 1 - 

In the case of a scheme 9 = F(^ 1( ..., 4*i) where 4'i, • • •» 4'i range over 
partial functions (subject to any stated restrictions), we say that F is 
partial recursive, or that 9 is partial recursive uniformly in 4 ^, .... 4 'i. if 
(for fixed n, l, m v ..., mf) there is such an E independent of 4' i . .... 4'i- 
As before, we omit the word “uniformly” except for emphasis. 

We now have, for the case the 9 is not previously known (as at the 
end of § 55): A system E of equations defines recursively a partial re¬ 
cursive function of n variables from partial functions 4 'i> • • •. 4 'i. if f° r 
each M-tuple x v ..x n of natural numbers there is at most one numeral 
x such that E^^j, E b f(x x , ..., x B )=x (where f, g x , ...,g, are 
as before). Here no completeness property is required. The function 
which is defined recursively by E is the function 9 such that ..., x n ) 
is defined for a given w-tuple x v ..., x n , if and ©nly if there is an x for this 
x v .... x n , in which case y(x v ..., x n ) = x where x is the number for 
which the x is the numeral. 
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The partial recursive functions include the general recursive functions 
as those for which the range of definition consists of all the n-tuples 
x v ...,%„ of natural numbers. 

When R(x v ..x n ,y) is a general recursive predicate, the partial 
function [iyR(x v ..x n ,y) (defined if and only if (Ey)R(x 1 , y), 

in which case its value is the least y such that R{x v ..x n ,y)) is partial 
recursive. We have already shown this (without the present terminology) 
as (iv) in the first part of the proof of Theorem IV § 57 . (Then 
y.yR(x lt ..x n ,y) is general recursive exactly when (lb) § 57 holds.) 

Example 1 (concluded). p.y 7 ’ 1 (*, x, y) is partial recursive. 

For partial functions Xi> • • •. X«» 'l'. we tak e •••.*«)>•• •» 

Xm( x i> .... x n )) to be defined when and only when Xi(*i> • • •> x n), • • •, 
Xm[ x v ■ ■ ■, x n ) are all defined and their values constitute an m -tuple 
for which ^ is defined, except when we have otherwise specified. We call 
this the weak sense of <Mxi(*i> • ••.*«)» • ••>Xm(*i» • ••,*«))• This con¬ 
vention shall apply likewise to definitions by substitution not in this 
standard form (cf. § 44 ). The ambiguity which the convention removes 
arises when tp is a constant function, or becomes such in one variable 
for some substitution for the other variables. For example, shall 0 • x{x) 
have the value 0 or be undefined, for an x which makes xi x ) undefined ? 
According to our convention, it shall be undefined. 

We also employ partial predicates with the same convention. For 
example, by substituting partial functions <];(*i, ..., %„) and x{ x i> • ••>*«) 
into the completely defined predicate y t —y 2 , we obtain a partial pred¬ 
icate ty{x v ..., x n )=x(x 1 , ..., x n ). This predicate, for given x v ..., x n , 
is defined if and only if (j/ and x are both defined, in which case it takes 
a true proposition as value if ^ and x have the same value, and a false 
proposition as value if ^ and x have different values. 

Similarly, by substituting partial predicates Q(x v ..., x n ) and 
R(x lt ... ,x n ) into the truth-value function Y x s= Y 2 (‘equivalence’, §45), 
we obtain a partial predicate Q(x v ..., x n ) = R{x v ..., x n ), defined if 
and only if Q(x lt ...,%„) and R{x lt ...,%„) are both defined, in which 
case it asserts the equivalence of those two propositions, being true or 
false according as the two are equivalent or not. 

We now introduce “i>(x v ..., x n ) ~ xi x i, ■ ■ •» x n )” to express, for 
particular x v .... x n , that if either of ^{x lt ...,*„) and x( x i> ■ • x n) 
is defined, so is the other and the values are the same (and hence if 
either of ^{x lt ..., x n ) and x( x v •••>*«) is undefined, so is the other). 
The difference in the meaning of (i) “^(^i. • • •, x n ) = x( x i> • • •> x *)” an( t 
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(ii) "^{x x , — x( x i> • x n)" comes when one of x n ) and 

y{x x , ..., x n ) is undefined. Then (i) is undefined, while (ii) is true or false 
according as the other is or is not undefined. We distinguish = and ~ as 
weak and complete equality, respectively. Our use of ~ constitutes an 
exception to the convention stated above. 

Similarly “Q(x v = R{x x , ...,*»)” shall express, for particular 

x x , ..., x n , that if either of Q(x x , ..x n ) and R(x lt ..., x n ) is defined, 
so is the other and the two values are equivalent propositions (and hence 
if either is undefined, so is the other). We distinguish == and ~ as weak 
and complete equivalence. 

Now (x x ) ... (* n )[<M*i, ..., x„) ~ x(x x , .... *„)], or <\>(x x , ...,x n )~ 
x(x x , ..., x n ) when x x , ..., x n have the generality interpretation, 
expresses that ^ and x are equal as functions, i.e. they have the same 
range of definition, and over this common range they agree in value. 
Similarly (x x ) ... (x n )[Q{x x , ...,*„) ^ R(x x , .... *„)], or Q(x x , ...,x n ) 
^ R(x x , ..., x n ) under the generality interpretation of x x , .... x n , ex¬ 
presses that Q and R are equal as predicates. 

We say that a partial function <p(x x ,..., x n ) is the representing function of 
a predicate P(x x , ...,*„), if <p{x x ,... ,x n ) takes only 0 and 1 as values, and 

P{x i, • • •, x n ) *?(x x , ..tf n )=0, 

or in other words, if according as the value of P(x x , ..., x n ) is t, f or u, 
that of <p(x x , ..x„) is 0, 1 or u. 

We say that a partial function cp or partial predicate P is partial 
recursive in partial predicates and functions T, if the corresponding 
statement holds replacing the predicates among P, T by their representing 
functions. 

The role oi the two equality predicates = and ~ will be different. 
The weak equality = will serve as an operation in building partial 
recursive predicates. We shall see in a moment (Theorem XVII ## 14 , 
Cf) that ^(*1. ..., x n )—x(x x , ...,x n ) is partial recursive in ^ and x- 
The complete equality ~ will be used in expressing our theory about 
partial recursive functions. The predicate ty(x x , ..., x n ) ~ x(x x , ..., x n ) 
is not always partial recursive when ^ and x are partial recursive (cf. 
Example 7 § 64 ). 

Similar remarks apply to the two equivalences = and ^ (cf. Theorem 
XVII #L)t'and Example 8 § 64 ). 

The particular functions and predicates of ## 1 —21 (§§ 44 , 45 ), 
being primitive recursive, are general recursive (Theorem II § 55 ) and 
therefore partial recursive. Rewriting Schemata (IV) and (V) (§ 43 ) with 



§63 


PARTIAL RECURSIVE FUNCTIONS 


329 


in place of and using our convention to read the expressions 

on the right in the weak sense, the notion '9 is primitive recursive in W 
takes on a meaning for the case that the assumed functions and predicates 
are partially defined. For example, in a primitive recursion (Va), for a 
given y, y(y') is defined, if and only if y(y) is defined and x(y, z) is defined 
when 2 is the value of ^(y) (hence by induction, only if all of 9(0), 9(1),..., 
9 (y) are defined). The notion extends from functions to predicates 
via representing functions. Now the former proofs of ##A—G (§§ 44 — 
47 ) apply, provided we understand the resulting functions and predicates 
in the suitable weak senses, e.g. 2 ^(y) is defined when and only when all 

y<z 

of 4 >( 0 ), ...,<[>(* — 1 ) are defined, Q V R when and only when both 
Q and R are defined, ( Ey) y<s R(y) when and only when R( 0), ..., R(z — 1 ) 
are, etc. In each case the appropriate weak sense is easily inferred from 
the proof. But also, with these senses for (IV) and (V), the proof of 
Theorem II (§§ 54 , 55 ) carries over. Hence: 

Theorem XVII. (a) Any function 9 definable from partial functions 
Y by a succession of applications of partial recursive schemes is partial 
recursive in T\ Schemata (I) — (V) are partial recursive, (b) The functions 
and predicates of ##1 —21 are partial recursive ; and — G hold 

reading “partial recursive” for “primitive recursive” and using the weak 
senses of the resulting functions and predicates (call them then # #A t —G*). 

We used the [x-operator above to form \xyR(x v ..., x n , y) as a partial 
function from a completely defined predicate R(x v ..x n , y). Now 
when R(x v ..., x n , y) is any partial predicate, we take y.yR(x v ..., x n , y) 
to be defined when and only when there is a y such that R(x v ..x n ,y) 
is true and R(x v ..., x n , 0 ), .... R(x v .. x n , y— 1) are all defined, in 
which case its value is the least such y. For example, if R( 0 ) = i?(l) = f, 
R( 2 )^t, then \>.yR{y)~ 2 t but if R( 0 ) ^ f, i?(l)=u, R{ 2)^t, then 
l iyR{y) u. 

Now we can consider (VI) § 57 reading “cx \iy” for “= |xy” as a schema 
for any given partial function xi x i> • ■ x n>y) or partial predicate 
R(x v y). Let the equations E be set up as in the proof of Theorem 

III. If pyR(x v ...> y) is defined, then E^, E |- f(x 1 , ..x n )=x 
when x — \xyR(x v ..., x n , y) and for no other numeral x, just as before 
((iii) § 57 ). Conversely, if E^, E |- f(x 1 , ..., x„)=x where x is a numeral, 
then the present conditions for \LyR{x 1 , ..., x n , y) to be defined are met 
(noting that for II xi x v • ■ x n> s ) is defined only if x( x i> ■■ - ,x n ,s) 

s<y 

for s = 0 , 1, ..., y—1 are all defined). Thus: 
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Theorem XVIII (= Theorem III + ). Schema (VI) is partial recursive. 
Hence by Theorem XVII: Every function 9 definable from partial functions 
and predicates Y by applications of (I) — (VI) is partial recursive in Y. 

Now let us see what happens to the proof of the normal form theorem 
(Theorem IX § 58) when we take <p(x x , to be a partial recursive 

function. Let E define 9 recursively. Now (24) and (26) do not necessarily 
hold, but instead, omitting the universal quantifiers, we obtain (from 
(26)) (Ey)S n (e, x v ..., x n , y) as the condition on x 1 ,...,x„ that 
f(x v ..., x n ) be defined. Also (25) and hence (27) hold with in place of 
“ = and (28) holds, not with the generality interpretation, but for each 
«-tuple x x , ...,x n for which <?{x v ...,x n ) is defined. But under our 
meaning of the (^.-operator applied to a completely defined predicate, 
U{[iyS„{e, x x , .... x n , y)) is defined exactly when ( Ey)S n {e, x x , ...,x n , y). 
Hence both members of (28) have the same range of definition, so (28) 
with “=” replaced by holds for all x v ..., x n . The proof given for 
Theorem IX* (under Theorem X) goes through with the same modifi¬ 
cations, provided the assumed functions t}^, ..., <J/j are completely defined 
(otherwise e.g. (Eu x ) Ui<y [Nu((y) oxx , u x ) & Nu((y) 0>2 , 4>K))] in 0Dfl2*, 
likewise $(y), might be undefined in some case in which for the proof 
we would need it to be defined). Thus: 

Theorem XIX. (a) (= Theorem IX*). Given any partial recursive 
function <f>(x x , ..., x n ), a number e can be found such that 

(60) ?(%, ...,*„) ~ 17 ( y-yT n (e, x v .... x n , y)) 

(so that (. Ey)T n (e, x x , ..., x n , y) is the condition of definition of the function 
<p(* 1( ..., *„)), and 

(61a) (x x ) ... (x n )(y) [T n (e, x x , .... x n , y)-+ U(y) ~ cp(x x , ..., x n )]. 

(b) (= Theorem IX*!). Similarly reading “partial recursive in Y", 
“Tin place of “partial recursive”, “T n ”, respectively, where “Y” 
stands for any l (> 0) completely defined functions and predicates of 
m x , ..., m t variables, respectively. 

Extending the definitions given in § 58, we say that any number e 
such that (60) (and hence (61a)) holds defines 9 recursively or is a Godel 
number of 9 ; and similarly in the case of a function 9 partial recursive 
in completely defined functions Y. The notions extend to a predicate P 
with representing function 9 as before. 

By the proof of Theorem XIX, if E is a system of equations defining 
9 recursively (recursively from completely defined Y, with suitable 
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given function letters), and e is the Godel number of E, then e defines 9 
recursively (recursively from Y). 


Example 2 . Let / and g be Godel numbers of partial recursive 
functions ^ and y, respectively. Then the predicate 'ii(x) ~ y{x) is ex¬ 
pressed by 

mm, x, y) -> (Ez)(T,{g, x, z) & U(y)=U(z))} 

& {7\(g, x, y) -> (£*)(7\(/, x, z) & U(y)=U(z))}]. 

Corollary. Every partial recursive function 9 (function 9 partial 
recursive in completely defined functions and predicates Y) is definable 
(definable from Y) by applications of Schemata (I) — (VI). 

The algorithm given by ( 60 ) for calculating a partial recursive function 
9 is of the kind which does not terminate when <p(x v .... x n ) is undefined. 

By the theorem, a partial recursive function <p(x v .. .,x n ) has a range 
of definition of the form x(Ey)R(x v ..., x n , y) where R is primitive re¬ 
cursive, or in other words for n = 1, the range of definition (if it has a 
member) is recursively enumerable (Theorem XIV § 60 ). 


Example 3 . Hence by Theorem V ( 12 ) § 57 , no partial function with 
x(y)T 1 (x, x, y) as its range of definition is partial recursive. It is im¬ 
possible to devise any algorithm which will lead to some natural number 
for exactly those x’s such that (y)T 1 (x, x, y) and no others (by Thesis 
H (a) below). In particular, the function 9 defined thus, 

«.(*)-{ 0 if Wryxx.y). 

I u otherwise, 


is not partial recursive. This function is effectively calculable for x’s in 
its range of definition, and is potentially recursive. 


Now we state Church’s thesis for the case of partial functions, keeping 
in mind Examples 1 and 3 . We call a partial function <p(x v .... x n ) 
potentially partial recursive, if there is a partial recursive function 
9'(x 1 , .. x n ) such that <p'(x v ..., x n ) = <p(x v ..., x n ) on the range 
of definition of 9. The thesis will be stated in two parts, according as we 
require that the effective calculation procedure lead to no function value 
off the range of definition of the function in question, or merely disregard 
what happens off this range. 


Example 4. Show that: If 9(x) is potentially partial recursive and 
has a range of definition of the form x(Ey)R(x, y) where R is general 
recursive, then 9(x) is partial recursive. 
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Thesis I + . (a) The function which any algorithm calculates {the function 

being undefined for each n-tuple of arguments for which the algorithm leads 
to no natural number as value) is partial recursive, (b) Every partial function 
which is effectively calculable {in the sense that there is an algorithm by 
which its value can be calculated for every n-tuple belonging to its range 
of definition) is potentially partial recursive. 

The thesis can also be phrased to apply to predicates. 

For l {> 0) assumed functions and predicates Y, we have a corre¬ 
sponding Thesis I* t , including the case of uniformity. 

Much of our treatment of functions partial recursive in l assumed 
functions and predicates Y will be limited (for l > 0 ) to the case each 
of the functions and predicates Y is completely defined, or is incompletely 
defined but partial recursive or partial recursive in completely defined 
functions. These kinds of incompletely defined functions have been 
introduced to meet the requirements of the theory of algorithms (in¬ 
cluding reductions of decision problems), because it may be impossible 
to complete the definition of such a function and still have an algorithm 
for it. Outside the theory of algorithms, a like reason for not completing 
the definitions of incompletely defined number-theoretic functions is 
not readily apparent (cf. Example 4 § 64 ). 

Example 5 . If ^(x) is a partial function with the range of definition 
x{Ey)R{x, y) (or xR{x)) where R is a completely defined predicate, then 
there is a completely defined function ^ c (x) primitive recursive in <J; ( R 
such that 4 1 is partial recursive in <J» C . For let C = x{Ey)R{x,y); let 6 
enumerate recursively { 0 }-|-C' (cf. Example 6 § 60 ); and let 
<J/°(y) = 2 0(v) • 3 7 ) (0(l ' >l where t)(0) = 0, •/)(%') ~ vj<(x). 

Now <!»(*) ~ (^%y[(^ c (y))o=*']))i- 

Example 6 . W 0 {x, [iy[W 0 {x, y) V W^x, y)]) is a partial, but not 

potentially, recursive predicate (cf. § 61 ). 

§ 64. The 3-valued logic. In this section we shall introduce new 
senses of the propositional connectives, in which, e.g. Q{x) V R{x) will be 
defined in some cases when Q{x) or R{x) is undefined. 

It will be convenient to use truth tables, with three “truth values" 
t (‘true'), f (‘false’) and u (‘undefined’), in describing the senses which 
the connectives shall now have. 

Some remarks are appropriate to justify our use of truth tables here 
from the finitary standpoint, and to explain how we are led to choose the 
particular tables given below. 
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We were justified intuitionistically in using the classical 2-valued 
logic, when we were using the connectives in building primitive and 
general recursive predicates, since there is a decision procedure for each 
general recursive predicate; i.e. the law of the excluded middle is proved 
intuitionistically to apply to general recursive predicates. 

Now if Q(x) is a partial recursive predicate, there is a decision pro¬ 
cedure for Q(x) on its range of definition, so the law of the excluded middle 
or excluded “third” (saying that, for each x, Q(x) is either t or f) applies 
intuitionistically on the range of definition. But there may be no algorithm 
for deciding, given x, whether Q(x) is defined or not (e.g. there is none 
when Q(x) is yyT^x, x, y)= 0 ). Hence it is only classically and not in¬ 
tuitionistically that we have a law of the excluded fourth (saying that, 
for each x, Q(x) is either t, f or u). 

The third “truth value” u is thus not on a par with the other two t and f 
in our theory. Consideration of its status will show that we are limited 
to a special kind of truth table. 

In asserting e.g. that Q(x) V R(x) is primitive or partial recursive (uni¬ 
formly) in Q and R, we assert the existence of an algorithm for obtaining 
the truth value of Q(x) V R(x) from those of Q(x) and R(x). Again, in the 
partial case, u will have a different status from t and f. 

Suppose we are to pick a truth table for Q V R, so that Q(x) V R(x) 
will be partial recursive (uniformly) in Q and R. Let us discuss this heuris- 
tically, for the moment, identifying partial recursiveness with effective 
decidability. We ask to be able to decide by an algorithm, given x, whether 
Q(x) V R(x) is t or f (if it is defined) from information that Q(x) is t or is f 
(if it is defined) and like information about R{x). Information that Q(x) 
is u is not utilizable by the algorithm; u means only the absence of in¬ 
formation that Q(x) is t or is f. If in case Q(x) is u, the algorithm gives e.g. 
t as value to Q(x) V R{x), the decision to do so (for the given x and R(x)) 
must not have depended on information about Q(x) (since none was 
available). In particular, if without changing the value of R(x), that of 
Q{x) were changed to t or f, the same decision would still be made. 

We reach the same conclusion, if we ask instead merely that 
Q{x) V R(x) be partial recursive, whenever Q and R are partial recursive. 
In general, an algorithm for Q{x) V R(x) (ab initio) will have access to 
information about Q(x) and R(x) only by utilizing algorithms for Q(x) 
and R(x) which are incorporated into it. A decision reached in pursuing 
the algorithm for Q(x) V R{x) that e.g. t is the value must have been based 
on information about Q[x) and about R(x) which had been produced at 
some finite stages in pursuing the algorithms for Q{x) and R(x). At any 
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stage in the algorithm for Q{x), we shall either have found out that 
Q(x) is t, or that Q(x) is f, or we shall not have learned the truth value of 
Q{x). If Q(x) is actually u, we cannot learn this by pursuing the algorithm, 
but if at all only in some other way, as by some metatheoretic reasoning 
about the algorithm. For special Q’s we might substitute another al¬ 
gorithm which would also tell us when Q(x) is u, but we cannot do this 
in general. Thus, if when Q(x) is u, Q(x) V R(x) receives the value t, the 
decision must (in the general case) have been made in ignorance about 
Q{x), and in the face of the possibility that, at some stage in the pursuit of 
the algorithm for Q(x) later than the last one examined, Q(x) might be 
found to be t or to be f. 

Proofs in terms of the theory of partial recursive functions confirming 
these heuristic arguments (and extending them to other operators besides 
the propositional connectives) will be given at the end of the section 
(Theorem XXI and Examples 6 — 8). 

We conclude that, in order for the propositional connectives to be 
partial recursive operations (or at least to produce partial recursive pred¬ 
icates when applied to partial recursive predicates), we must choose 
tables for them which are regular, in the following sense: A given column 
(row) contains t in the u row (column), only if the column (row) consists 
entirely of t's; and likewise for f. 

When we extended #D from primitive to partial recursiveness by 
taking over substantially the former proofs (#D t of Theorem XVII § 63 ), 
we were using the propositional connectives in the weak senses, which are 
described by the 3-valued tables (the weak tables) obtained from the 
classical 2-valued tables by supplying u throughout the row and column 
headed by u. These are regular tables (trivially). 

Now we introduce strong senses of the propositional connectives, 
described by the following strong tables. 


\ 

? 

QVR 

R t f u 

Q&R 

R t f u 

Q 

R 

-> R 

t f u 

Q 

R 

= R 
t f u 

Q t 

f 

Q t 

t t t 

Q t 

t f U 

Q t 

t f u 

Q t 

t f u 

f 

t 

f 

t f U 

f 

f f f 

f 

t t t 

f 

f t u 

u 

u 

u 

t u u 

u 

u f u 

u 

t u u 

u 

u u u 


Of these tables, only those for V, & and -> differ from the respective 
weak tables. Henceforth — , V, &, and = applied to partial predicates 
shall be understood in these strong senses, except when otherwise 
stated. 
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Example 1. Now ( 61 a) can be restated thus: 

( 61 b) (^i) ... {%n)(y)\T n (c, x lt .. ., x n , y) —► U(y)=<p(xi, ..., x n )]. 

Statements of the form “If cp^, ...,x n ) is defined, then where 

the conclusion is meaningless when cp(%, ..x n ) is undefined, can be 
understood then as uses of the strong ->. (Such statements occur already 
in § 63 .) 

These strong tables are uniquely determined as the strongest possible 
regular extensions of the classical 2-valued tables, i.e. they are regular, 
and have a t or an f in each position where any regular extension of the 
2-valued tables can have a t or an f (whether t or f being uniquely de¬ 
termined). 

We give the following three tables as examples of irregular tables. 
The present strong 3 -valued logic (Kleene 1938) is not the same as the 
original 3 -valued logic of Lukasiewicz (1920; cf. Lewis and Langford 
1932 pp. 213 ff.), which differs from it by having t instead of u in Row 
3 Column 3 of the tables for -> and = (labeled here as and = L ). 


Q 


L# 

Q 

=. 

lR 

Q 


R 

R 

t 

f 

u 

R 

t 

f u 

R 

t 

f « 

Q t 

t 

f 

u 

Q t 

t 

f U 

Q t 

t 

f f 

f 

t 

t 

t 

f 

f 

t u 

f 

f 

t f 

u 

t 

u 

t 

u 

u 

u t 

u 

f 

f t 


We further conclude from the introductory discussion that, for the 
definitions of partial recursive operations, t, f, it must be susceptible of 
another meaning besides (i) ‘true’, ‘false’, ‘undefined’, namely (ii) ‘true’, 
‘false’, ‘unknown (or value immaterial)’. Here ‘unknown’ is a category into 
which we can regard any proposition as falling, whose value we either do 
not know or choose for the moment to disregard; and it does not then 
exclude the other two possibilities ‘true’ and ‘false’. 

Example 2. Suppose that, for a given x, we know Q(x) to be undefined 
and R(x) to be false. Then using t, f, u as ‘true’, ‘false’, ‘undefined’ ((i)), we 
can conclude by the entry of Row 3 Column 2 in the table for V that 
Q{x) V R(x) is undefined. 

Example 3. Suppose that, for a given x, we know Q{x) to be true. 
Then, using t, f, u as ‘true’, ‘false’, ‘unknown’ ((ii)), we can conclude by 
the entry of Row 1 Column 3 that Q(x) V R(x) is true. To draw this 
conclusion by using the tables with Meaning (i), we would need to use the 
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classical law of the excluded fourth, thus: Either R(x) is t, f or u; and ifi 
each of Row 1 Column 1, Row 1 Column 2 and Row 1 Column 3 a t 
appears. 

From this standpoint, the meaning of Q V R is brought out clearly by 
the statement in words: Q V R is true, if Q is true (here nothing is said 
about R) or if R is true (similarly); false, if Q and R are both false; defined, 
only in these cases (and hence undefined, otherwise). 

The strong 3 -valued logic can be applied to completely defined pred¬ 
icates Q(x) and R{x), from which composite predicates are formed 
using — , V, &, h in the usual 2 -valued meanings, thus, (iii) Suppose 
that there are fixed algorithms which decide the truth or falsity of Q{x) 
and of R{x), each on a subset of the natural numbers (as occurs e.g. after 
completing the definitions of any two partial recursive predicates clas¬ 
sically). Let t, f, u mean ‘decidable by the algorithms (i.e. by use of only 
such information about Q{x) and R{x) as can be obtained by the al¬ 
gorithms) to be true’, ‘decidable by the algorithms to be false’, ‘unde- 
cidable by the algorithms whether true or false’, (iv) Assume a fixed state 
of knowledge about Q(x) and R(x) (as occurs e.g. after pursuing algorithms 
for each of them up to a given stage). Let t, f, u mean ‘known to be true’, 
‘known to be false’, ‘unknown whether true or false’. 

The following three classical equivalences 

( 62 ) Q&R^^VJ, ( 63 ) Q^R^QVR, 

( 64 ) Q^R^ (Q-+R)&(R->Q) 

hold, as the reader may verify by constructing the tables for the right 
members, and comparing them with the given tables for the left members. 
But e.g. Q & (RV R) ^ Q (cf. *52 § 27 ) does not (when Q is t and R is u, 
the left member is u and the right is t). 

The proofs of ( 62 ) — ( 64 ) by use of the 3 -valued tables can be construed 
as showing that if either member is defined the other is and has the same 
value (as ~ asserts), using the law of the excluded third on the ranges 
of definition. This law we have given when Q and R are partial recursive ; 
and the equivalences in the general case are subject to it intuitionistically 
as an hypothesis. Similar remarks apply to ( 65 ). 

Strong senses are given to the bounded quantifiers, thus. For each 
^ >-0, (Ey) y<z R{y) ^ R{ 0) V...Vfi(z-l); and (Ey) y<0 R{y) ^ f. Ex¬ 
pressed in- words, ( Ey) y<z R(y ) is true, if R(y) is true for some y < z; 
false, if R(y) is false for all y < z; defined, only in these cases. 

Similarly, for each z > 0, (y) v<z R(y) = R(0 ) & ... & R(z — 1); and 

(y),< a m st. 
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We then have 

( 65 ) iy) v < z m ~ (Ey) v <My)- 

Similarly, we give strong senses to the unbounded quantifiers, by 
thinking of (Ey)R(y) as the strong disjunction of R(y) for y = 0 , 1 , 2 , .. ., 
and of (y)R(y) as the strong conjunction. 

In words, (Ey)R(y) is true, if R(y) is true for some natural number y; 
false, if R(y) is false for every y; defined, only in these cases. (Then 
(y)R(y) = (Ey)R(y).) Unlike the bounded quantifiers (Theorem XX (b)), 
the unbounded quantifiers are of course not partial recursive operations 
(Theorem V § 57 ). 


Example 4 . Consider the class of the predicates expressible in the 
form ( Ey)R(x , y) with R partial recursive. The definitions of all incom¬ 
pletely defined predicates of this class can be completed without going 
outside the class (in fact, with a primitive recursive R). For if r is a Godel 
number of R(x, y), then (Ey)[T t (r, x, (y) 0 , (y) x ) & U((y) 1 )=0] is com¬ 
pletely defined and = ( Ey)R(x, y) on the range of definition of the latter. 
Similarly for the form (y)R(x, y). (Cf. Kleene 1943 p. 57 Theorem VI.) 


Theorem XX. (a) The (strong) predicate Q(x v ...,x n ) is partial 
recursive in the predicate Q. The ( strong ) predicates 
Q(x v ...,*„) V R(x v ...,x n ), Q(x lt ...,x n )& R(x lt ..., x n ), 

Q(x v R(x lt ...,x n ) and Q(x v ...,x n ) = R(x v ...,x n ) are 

partial recursive in the predicates Q and R. 

(b) The (strong) predicates (Ey) y<z R(x 1> ...,x n , y) and 
(y) v<z R( x i, ..x n ,y) are partial recursive in the predicate R. 

(c) (Strong) definition by cases. The function 9 defined by 


<p(*i, • • x n ) ~ 


•••>*«) if <2i(*i> •••>*«), 

4*m(-^l> • • •» %n) if Qm(%l> • • •> %n)> 


where Q x , ..., Q m are mutually exclusive (under the interpretation that 
y(x v . .., x n ) shall ~ 9 i(x v ..., x n ) if Qi(x lt ..., x n ) is true, disregarding 
<f>j(x v ..., x n ) and Qj(x v ..., x n ) for all j ^ i), is partial recursive in 

• • •> Ql> • • •> Qm- 


Proofs, (a) To treat Q V R for example, let ^ (x x ,..., x„), x(x v ..., x n ) 
and <p(x x , ...,x n ) be the representing functions of Q(x v ...,x„), 
R(x v ..., x n ) and Q(x v ..., x n ) V R(x v ..., x n ), respectively. Consider 
the equations 
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cr(0) = 0 , t(1 , 1) = 1, 

•••»*«) — g(< 1 >(x v 

[ X l> • • •> x n ) == * • •* x nj)> 

9i X V • • *i X n) — T (4 , (^'l* • * •’ X n)> X( X 1> • • •* X n))> 

where a and t are partially defined auxiliary functions. By translating 
these equations (A) into the formal symbolism of recursive functions 
(§ 54), we obtain a system E defining <p recursively from t}>, x- — The 
method illustrated applies to any regular table. (Alternatively, one 
can treat the other connectives thence, noting that Q has already been 
treated in D* of Theorem XVII, since the strong table for Q agrees with 
the weak one, and using ( 62 ) — ( 64 ).) 


(b) By (65) it will suffice to treat (Ey) y<z . Let x(*i* ...,x n ,y) 
and <p(x v ...,x n ,z) be the representing functions of R(x v ..x„,y) 
and {Ey) u<z R(x 1 , y), respectively. Then (using ##6, B f ) the 

equations 

f «r(0) = 0, t(1) = 1, 


(B) 


<?{ x v ...,x n , z') = a(x( x i, z—y)), 

f( x l> ..., x n,z) = x( n x( X l, ---.Xn, y)) 

y<z 


translate into a system E defining 9 recursively from x- 


(c) First method. Like (a); e.g. for m = 2 and n — 1, 
r <Tj(0, x) = <p x (x), (T 2 (0, x) = <p 2 {x), 

' 1 f( x ) = x )> ?(*) = *)■ 


Second method, for Q v ..., Q m simultaneously defined. 

9 ~ (xy[(<? x &y==? x ) v ... V (Q m & y=? m )]. 

Remark 1. For consistency with our usage in (VI') § 57 , and at the 
beginning of § 54 where we first handle “==” quasi-formally before trans¬ 
lating, we write (A) — (C) here with “=”. But considered intuitively, 
as laws obeyed by the partial functions appearing in them, (VT) and 
(A) — (C) should be written with 

Example 5. If 9(x) is partial recursive, and ‘y(x) is defined’ is 
general recursive, then y(x) is potentially recursive. For let 

9'(x) ~ \iy[y—y{x) V (<p(x) is defined & y= 0 )]. 

Let V F be a sequence of partial functions 1^, ..., <|»|. By an extension 
T” of V F we mean a sequence <jq, .. -, tp* °f partial functions which are 
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extensions respectively of ^1. • he. suc h that for i— 1, 

4 »i'(yi, • • • , ym t ) = . • ■, y m ,) on the range of definition of <J/<. Simi¬ 

larly if Y include predicates. 

The reader wishing to advance rapidly to the main results of the chapter 
in § 66 may omit Part (b) of the following theorem. 

Theorem XXI. (a) If 9 ~ F(Y) or <p(x) ~ F(Y; *) is a partial re¬ 
cursive functional, and F(Y X ; x-f) ~ k, where Y x are particular functions 
and x v k are particular natural numbers, then for every extension Y/ of Y x , 
F(Yi'; ~ 

(b) Let a function 9 be defined from a function p by an operation of the 
form <p(x) ~ F(^(%)), where F(a) is a function from {u, 0, 1, 2 , ...} to 
{u, 0 , 1 ,2, ...}. If F(u) ~ k, where k is a natural number, but for some 
natural number m, not F (m) ~ k, then there is a partial recursive function 
4 * (taking only m as values) for which the resulting function 9 is not 
partial recursive. 


We can read “x v ...,x n 


v n> 




in place of “x”, 


X-% 


respectively (where n ;> 0 for (a), > 1 for (b)); and the theorem can be 
stated for predicates, with t and f taking the place of 0, 1,2, ... as defined 
values. 


Proofs, (a) By hypothesis, there is a system E of equations with 
given function letters G and principal function letter f, such that, for 
any natural number x and partial functions Y: E^f, E b f(x) —y 
where y is a numeral, if and only if F(Y;*)=y. But F(Y 1 ;a; 1 ) = k, 
where k is a natural number; so E^ 1 , E (- i(x 1 )=k. Now if Y/ is any 
extension of Y 1( then Ej 1 C Ej 1 '; hence also E^’ 1 , E (- f(x 1 )=fe; 
and therefore F( Y/; x-f) = k. (If the functional F(Y) is defined only under 
some restriction on the range of Y, the theorem applies only to extensions 
Yj” satisfying the restriction.) 

(b) Let ^(*) ~ m-\-0-\xyT x {x, x, y) and p(#) ~ \Ly[y{x)=k & y=0]. 
By Theorem XVIII, tj'(^) is partial recursive, and p(*) is partial recursive 
if y(x) is. We show now that p(x) is not partial recursive. If (Ey)T 1 (x, x, y), 
then pyr x (x, x, y) is undefined, hence <|(*) is undefined, hence by hy¬ 
pothesis cf(x) = k, hence p(x) is defined. Thus (EyjT^x, x, y) -*■ (p(x) is 
defined}. Similarly, (£y) 7 \(£, x, y) (p(.v) is undefined}; or by contra¬ 
position (cf. *13 § 26 ), (p(*) is defined} -> (£y)7\(x, x, y). Thus (p(x) is 
defined} = {EyYT-^x, x, y) = {y)T 1 (x, x, y). By Example 3 § 63 , p(x) is 
therefore not partial recursive. 

In Examples 6 — 8, we give (a) and (b) parts separately, so as to 



340 


PARTIAL RECURSIVE FUNCTIONS 


CH. XII 


illustrate both parts of the theorem, although the conclusion of the (b) 
part implies that of the (a) part. 

Example 6. Can we improve upon Theorem XVIII by strengthening 
[xyR(x, y) to be the least y such that R(x, y) is true irrespective of whether 
R(x, 0 ), .. R(x, y— 1) are all defined (write it then \x'yR(x, y)) ? 
(a) No, since when R(x, 1) is t, then i±'yR(x, y) changes from 1 to 0 when 
R(x, 0 ) is changed from u to t. In more detail: Let y(x, y) be the rep¬ 
resenting function of R{x,y) and let cp(v) ~ \x'yR(x, y) ~ F(x; x). Let 
Xi, x i be choices of y, x such that Xi(*i» 0 ) ~ u, Xi(*i> 1 ) — 0 - Then 
F(xi; x-i) — b Now by (a) of the theorem, if 9 is partial recursive in y, 
then F(x{; x t ) ~ 1 for every extension y x of Xi- But F(xj; x~ 0 for an 
extension y' x such that Xi(^i> 0 ) ~ 0 . Therefore 9 is not partial recursive 
in x- (b) We can even find a particular partial recursive R for which 
[i .'yR(x, y) is not partial recursive. To see this, let R(x, y) be of the form 
y = ^(x) V y= 1 . Then [i'yR(x,y ) changes from 1 to 0 when <\/(x) is 
changed from u to 0 . In detail: Let y(x) ~ \x'yR(x,y) ~ F(^(x)). Then 
F(it) ~ 1, but F(0) ~ 0 . Hence by (b) of the theorem a partial recursive <j/ 
can be chosen so that 9 is not partial recursive. 

Example 7. When y(x) ~ 0, then <J/(x) ~ y(x) changes from f to t 
when 'ji(x) is changed from u to 0. Hence (a) <y(x) ~ y(x) is not partial 
recursive in (J/, y. Also (b) for some partial recursive ^ and x. — x{x) 
is not partial recursive. (First take x to be Cj § 44; then choose a ^ to go 
with this x by (b) of the theorem.) 

Example 8. The table for Q ^ R is irregular; e.g. when R ^ t, 
then Q = R changes from f to t when Q is changed from u to t. Hence 
(a) Q(x) ^ R(x) is not partial recursive in Q and R, and (b) for some 
partial recursive Q and R, Q(x) = R(x) is not partial recursive. — Any 
irregular table can be dealt with similarly. 

§ 65 . Godel numbers. We now abbreviate U{\iyT n {z, x v ..., x n , y)) 
as “<b n {z, x lt ...,x n )” or even as "{^}(%. ...,v n )” or “z(x v 
By Theorem XVIII, is a partial recursive function of n + 1 variables 
(and hence, for each fixed z, <t \(z, x x ,, x n ) is a partial recursive function 
of the -n variables x v ..., x n ). Rewriting Theorem XIX ( 60 ), any partial 
recursive function y{x v ..., x n ) of n variables can be obtained from O n , 
thus, 

(66) <p(x v ..., x n ) ~ x lt ..., x n ) ~ e(x v .... x„), 

where e is any Godel number of 9. Similarly, I 7 ((i.yjJ(z, x v .... x n , y)) 
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will be written (2, x v “{z} W (x v ..., x n )” ov“z' ¥ (x l , ...,x n )” 

for completely defined functions Y, with corresponding remarks. Sum¬ 
marizing : 

Theorem XXII (= Theorems XVIII + XIX). The function 
0„(z, %,..., x n ) is -partialrecursive, and<& n (z, x lt ... ,x n ) for z = 0, 1,2, ... 
is an enumeration (with repetitions) of the partial recursive functions 
of n variables. Similarly, for completely defined Y, Oj is partial recursive in 
Y and enumerates (with repetitions ) the functions of n variables which are 
partial recursive in Y. (Enumeration theorem for partial recursive 
functions.) 

This theorem is only possible because a partial recursive function 
may be undefined for some sets of arguments. 

Let us recapitulate the usual Cantor diagonal argument for completely 
defined functions. Suppose C is a class of such functions of various 
numbers of variables; and that Q>(z,x v ..., x n ) enumerates (with rep¬ 
etitions) the n-variable functions of C (for some fixed n ;> 1). Then 
<t>(x v x v x 2 , ...,#„)+ 1 cannot 8 C, because otherwise we would have 

<*>(?, q,x 2 , ...,*«) +1 = 0(?, q, x 2 , .... *„) 

for some number q, which is impossible. If C is closed under the operation 
of passing from a function 9 (z, x v ...,x n ) to y(x v x v x 2 , ...,*„)+1, 
then 0(2, x v .... x n ) cannot 8 C. In particular, this shows that there is 
no corresponding enumeration theorem for the general recursive functions. 

But with partial functions, we would have instead 

0%, q, x 2 , ..., *„)+l ~ ®(q, q, x 2 , . ...x n ), 

which is not impossible, but simply means that 0(^, q,x 2 , ..., x n ) must 
be undefined. In particular, we thus prove that the partial recursive 
function 0„(2, x v .... x n ) is undefined, when 2 = x x = q where q is any 
Godel number of $„(%, x v x 2 , ..., x n ) +1. 

The Richard paradox (§11) arose by attempting to maintain simul¬ 
taneously that 0(2, x v ..., x n ) is in the class and is completely defined. 
(In § 11, n = 1.) 

Diagonal reasoning with partial functions will be exploited further 
in establishing Theorem XXVII § 66. 

Example 1. Let rj(z v ..., z r ) be a given partial recursive function. 
For each z v ..., z T , let (p Zl ,..., Zr (x lt ..., x n ) be the partial recursive 
function which is defined for an «-tuple x v ..., x n only if y)(2 1; ..., z r ) 
is defined, and of which in this case rj(z v ...,z r ) is a Godel number. 
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Then <p Zl ,...,z r (x v ..x„) considered as a function cp(z x , . .. , z r , x v . .., x n ) 
of all r-\-n variables is partial recursive. For 

( 67 ) cp {z v ...,z r ,x v ...,*„) ~ O n (7)(z 1 , .. .,z r ), x v . ..,x n ). 

(This is an example for the first alternative under (A 3 ) § 62 .) 


Theorem XXIII. For each m, n >_ 0, there is a primitive recursive 
junction S™(z, y v ..., y m ) (defined below) such that, if e defines recursively 
<p(y 1( .. y m , x v ..., x n ) as a function of the m-\-n variables y v ..., y m , 
x v ..., x n> then for each fixed m-tuple y v ..., y m of natural numbers, 
S%(e, y v ..., y m ) defines recursively <p(y v ...,y m ,x v ..., x n ) as a function 
of the remaining n variables x v ..., x n . Stated in the X-notation (§ 10 ): 
If e defines Xy x ... y m x 1 ... x n cp(y x , ...,y m ,x v ..., x n ) recursively, then 
Sn(e, y v defines \x x ... x n cp(y 1 , ...,y m ,x v .. .,x n ) recursively. 

Similarly reading “primitive recursive function S 1 ff’ mi ’" 4 ’ m >(z,y 1 , .. .,y m )”, 
“defines recursively from T” in place of “primitive recursive function 
^n( z > Vi’ • • •’ ym)”, “defines recursively’’, respectively, where Y are l 
completely defined functions and predicates of m x , ...,mi variables, 
respectively. 


Proof (for / = 0 ). Let S®(z) = z. For m > 0 , choose numbers y v ..., y m . 
With these fixed, let <p(y x , ..., y m , x v ..., x n ) be written “y(x v .... x „)” 
Now 

(68) cp(* x , ...,x n ) f=!<I) 

m+n {e,y v ...,y m , X V ...,x n ). 

Let D be a system of equations defining O m+n recursively, with g as 
principal function letter, and not containing f. Let C consist of the equa¬ 
tions of D followed by the equation 

f(a x , .. ■, a n )=g(e, y x , ...» ym> &i> • • • > &«) 


which we obtain by translating (68). This system C defines y(x x , 
recursively. Let d,f,g,a v be the Godel numbers of D, f, g, 

a v .... a n , respectively. Let 5”*(z, y v .... y m ) = 

d *[2 exp 2 15 •3 2/ * p I , *•••* p » , •5 2ff " 3iVu< * ),,> 2 rw(!,,) ''“ ,, m+? m> ' p OT+2'“ / PTO+»+i] 


(cf. § 56 , #21 § 45 , Example 2 § 52 ). Then S™(z, y v ..., y m ) is primitive 
recursive as a function of z, y x , ..., y m ; and for the fixed y v ..., y m , 
S'ff(e, y v ... ;y n ) is the Godel number of C, and hence defines <p(x v ..., x n ) 
recursively. 


Example 1 (concluded). We do not get a larger class of functions 
9 (z x , ..., z r , x 1( ..., x n ) by starting with a partial recursive 7) (as above) 
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than with a primitive recursive y). For let the <?{z lt ..z r , x, ...,*„) of 
( 67 ) have e as Godel number. Then S r n {e, z v ..., z r ) is a Godel number 
of 9z 1 ,...,* f (* 1 , ..., *„), i.e. S r n (e, z v .... z r ) is a primitive recursive 
z r )” for the same <p Zl . Zf (x v . ..,x n ). 

Example 2 . For any general recursive predicate R(x v ..x„, y) 
(with n > 1), let e be a Godel number of the partial recursive function 
lx x zx 2 ... x n \xyR{x 1 , ...,x„,y). For any fixed x v then S*(e, x x ) is a 
Godel number of Xzx 2 ... x n ^.yR{x 1 , .... x n ,y). This function is defined 
for a given «-tuple z, x 2 , ..., x n , if and only if (Ey)R(x v y). 

Hence by Theorem XIX, 

( 69 ) (Ey)R{x lt y) = {Ey)T „(S^(e, xj, z, x 2 , .... y). 

Substituting S\(e, x x ) for z, 

( 70 ) (Ey)R(x v ...,x n ,y) m (. Ey)T n (S l n (e , * x ), S^(e, xj, x 2 , .... x n , y). 

Taking n = 1 and writing “a” for “x 1 ” and “x” for “y”, this shows that 
(Ex)R(a, x) is primitive recursive in {Ex)T x (a, a, x). Thus the decision 
problem for (Ex)T x {a, a, x) is of highest degree of unsolvability for pred¬ 
icates of the form ( Ex)R(a , x) with general recursive R (cf. end § 61 ). 
Furthermore, by the definition of S(, a 1 ^a 2 -> S((e, a^)^S\{e, a 2 ). 
Thus there is a general recursive function ^(a) such that 
{a^a 2 -> 4 >(a x ) a 2 )} & {(Ex)R(a, x) ss (Ex)T^{a), <j/(a), x )}\in Post’s 
terminology (1944), the decision problem for the set a(Ex)R(a, x) is 
1-1 reducible to that for ^.(Ex^^a, a, x). That this is so for every set 
a(Ex)R(a, x) with general recursive R is a property of the present set 
a(£'%)r i (a, a, x) which Post showed is possessed by another recursively 
enumerable set “K” called by him “the complete set” (1944 p. 295 and 
Theorem p. 297 ). We obtain like results for the forms with more quanti¬ 
fiers and for l > 0 . For example, from ( 70 ) with n = 3 (l — 0 ), we infer 
that 

(Ex)(y)(Ez)R(a, x, y, z) = (Ex)(y){Ez)T 3 (S\(e, a), S\(e, a), x, y, z) 

(cf. * 71 , *72 § 33 ). Thus (Ex)(y)(Ez)T 2 (a, a, x, y, z) has a decision prob¬ 
lem of highest degree of unsolvability for predicates of the form 
(Ex)(y)(Ez)R(a, x, y, z) with R recursive, and the decision problem for 
any predicate of this form is 1-1 reducible to its decision problem. 

Example 3 . Omitting the z in Example 2, we obtain that the decision 
problem for a(Ex)R(a, x) is 1-1 reducible to that for a(Ex)T 0 (a, x) (and 
hence by Example 2, the predicates (ExjT^a, a, x) and ( Ex)T 0 (a, x) 
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have decision problems of the same degree of unsolvability); and sim¬ 
ilarly with more quantifiers, e.g. 

(■ Ex){y){Ez)R(a , x, y, z) = (. Ex)(y)(Ez)T 2 {S\{e, a), x, y, z), 
where e is a Godel number of [LzR(a, x, y, z). 

The major part of § 66 can be understood without the rest of this 
section. 

The A-notation (to be used in § 82 ). When 

Wl • ' * ym%\ • • • X n 9(y x , • • •> ym> x \t • ■ • j %n) 
is partial recursive and e defines it recursively, we shall use 

Ae)*l • • • X n 9 (yi> • • ■ > ymi x l> • • • I x n) 

as an abbreviation for S™(e, y v ..., y m ). Usually we omit the subscript 
“(e)”. Thus (for m = 0 ) “ A.x x ... x n <p(%, .... x n )” will stand for some 
Godel number of the function \x 1 ... x n <p{x v ..., x n ); and (for m > 0 ) 
“ \x 1 ... x n (f>(y 1 , ...,y m ,x lt ...,x n )” will stand for some primitive 
recursive function, whose value for each m-tuple y 1 , ..., y m of natural 
numbers as arguments is some Godel number of the function 
\x x ... x n cp (y v ...,y m ,x v ...,x n ). 

Using this abbreviation with the abbreviation 
“{z}(x x , . ..,x n )” for O n (z, x v . 
we have, for any n-tuple t v ... ,t n of natural numbers, 

{A* x ... «p (y v ...,y m , x v .. x n )}(t v ...,t n ) 
y(yv • • •» ym> • • • > ^n)- 

Like notation with “A w ‘. . may be used, when T are l completely 

defined functions and predicates of m x , .. .,nti variables, respectively. 

Lemma VI. Lemma I § 47 holds reading “partial recursive” in place 
of ‘‘primitive recursive”. Thus (for p = 1): If y{x x , ...,x n ) is partial 
recursive uniformly in functions 0, T, and y*{x x , ..., x n , c) is the function 
obtained when 0 in the definition of cp is taken to be a function 0* depending 
on a parameter c, then 9* is partial recursive uniformly in 0*, Y. 

Proof. By hypothesis there is a system E of equations such that, 
for each choice of 0 , Y, the system E defines the resulting function 9 
recursively from 0, Y. Let Y be ^1, ..., an d sa y tha.t the given function 
letters of E (expressing 0, Y, respectively) are t, g x , ..., g ; , and the 
principal function letter is f. Let c be a variable not occurring in E. Let 
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E* result from E by changing simultaneously each part h(r 1; ..., r s ) 
where h is a function letter and r x , ..r s are terms to h(r lf ..r„ c). 
Let E* be the system of equations 

* ' * * C) ' * ' » • • •, • • • t 3-Tnir c) == • • • i 

where g lt . ...g t are distinct function letters not occurring in E. Let 
E* be E*E*. We show that, for each choice of 0*, Y, the system E* 
defines cp* recursively from 0 *, Y. 

Consider any fixed choice of 0 *, Y and c. Let (E^*) c be the set of the 
equations t(s lf ..., s q , c)—u which 6Ef. Let (E^) c be the set of the 
respective equations t (s lt ..., s q )=u, i.e. the set of the equations 
expressing the values of the function 0 when we take 0(s x , ..s q ) = 
0 *(s x , .. s q , c) for the fixed c. 

We easily see that if (E?) c , eJJ”;Jj, E |- .*»)=x, then 

(E?*) c ,Ef;;;;f;, e* k f(x x , ...,x n ,c)=x, a fortiori E^;j;,E* h 
f(*i> ...,x n , c)=x. 

We now demonstrate the converse. Let (E'^'”'^) c be the set of 
the equations g<(y x , ..., y mi , c)=y where g t (y lf ..., y m .) =y 8 Ej[;"jj. 
Let (E*) c be the system of equations which results by substituting c 
for c throughout E*. We say that an equation e is a c-equation, if there 
is a deduction of e from E x E* and either e 8 (E^‘) c , or e 6 (Egj"'^) c , 

or the principal equation (§ 54 ) of that deduction of e 8 E* and the prin¬ 
cipal branch contains an application of R 1 which substitutes c for c. 
We can prove (by induction on the height of a given such deduction of e) 
that, if e is a c-equation, then (E^*) c , (e£J"'^) c , (E*) c h e. In this 
resulting deduction, every occurrence of a function symbol h is in a part 
of the formh(r x , .. .,r s , c). Any equation of the form f(x x , .. .,x n , c)=x 
deducible from E x E* is a c-equation. If in the resulting 

deduction just described we change simultaneously each part h(r 1( _, r„ c) 

to h(r x , ..., r s ), we obtain a deduction of f(x x , ..., x„)=x from 

(E?)«. E+£t|. E. 

Theorem XXIV. (a) If <p(x x , ...,%„) is partial recursive uniformly 
in partial recursive functions 0 X , ..., 0 r , then there is a partial recursive 
function <p (z v ..., z r , x v ..., x n ) such that, when t v ..., t r are any Gddel 
numbers of © x , ..., 0 r , respectively, cp(x x , . v , x n ) ~ <p(t v ...,t r , x v ...,x n ). 

(b) If is partial recursive uniformly in partial recursive functions 
0 X , ...,0 r , then there is a primitive recursive function yj (z v ...,z T ) such 
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that, when t v ...,t r are any Godel numbers of 0 1( . .., 0 r , respectively, 
7j (t v ..., t T ) is a Godel number of 9. 

Also both parts hold reading “partial recursive in 0 1( ..., 0 r , V F”, “partial 
recursive in 'F”, “Godel number from 'F in place of “partial recursive in 
0 X , . .., 0 r ”, “partial recursive’’, “Godel number”, respectively, where 'F 
are l completely defined functions. 


Proofs (for 1 = 0 ). (a) By use of Lemma VI, the result of replacing 
0,(s lf ..., Sqf) by 0 Qi (z o s v ...,s Qi ) (i = 1, ...,r) in the definition of 
<p{x v ..., x n ) from 0 lf ..., 0 r is a function y(z lt ..., z T , x v ..., x n ) partial 
recursive in <J> 8i , ..., G> 9r . But O ffi , ..., 0 9r are partial recursive, so 
9 (z v ..., z r , x 1 , ..., x n ) is partial recursive. Also 
©.(Sj, . ..,s ai ) ~ <J> 5< (* <( s lt ...,%), so <p(x v .. .,*„) ~ 9^, ...,t T ,x v .. 

(b) Let e be a Godel number of 9 (z v ...,z r ,x 1 , ..., x „), and take 
r i (z 1 , ...,z r ) = S r n {e, z v ...,z r ). 


Example 4 . By our proof of Theorem XIII § 60 (going back upon 
that of Theorem IV § 57 ), the / of Figure la § 61 can be any Godel number 
of the partial recursive function y.yR(x, y). As \xyR{x, y) is partial recursive 
uniformly in R, by Theorem XXIV (b) (applied to the representing 
function of R) there is a general recursive (actually, primitive recursive) 
function r t (z) such that, for any Godel number r of R(x, y), t](r) is an / for 
Figure la. This shows that the recursively enumerable set C 0 (namely, 
x(Ey)T x {x, x, y)) is what Post 1944 calls a creative set. (The Godel number 
r of R(x, y) takes the place here of Post’s “basis B” for the recursively 
enumerable set x(Ey)R(x, y).) 

Example 5 . Our examples of predicates not general recursive given 
in Theorem V § 57 were of n ;> 1 variables. — The predicate (Ex)R(x) 
of 0 variables is not general recursive uniformly in the predicate R. 
For suppose it were. Then by Theorem XXIV (a) for n = 0 (applied to 
the representing functions), there would be a partial recursive predicate 
Q(z) such that (Ex)R(x) s= Q(t) for any Godel number t of R(x). Now 
let e be a Godel number of the predicate T x {a, a, x). Then < 2 (S^(tf, a)) would 
be a general recursive predicate of a, and (ExjT^a, a, x) = Q(S\(e, a)). 
But [Ex^^a, a, x) is not general recursive (Theorem V ( 15 )). 

Indefinite description. Theorem XXV. For each w ;> 0, there is 
a partial recursive function \ n (z, x v .... x n ) with the following property. 
Suppose r is a Godel number of a partial recursive predicate R{x lt ..., x n , y). 
Then v n (r, x v . .., x n ) is defined if and only if (Ey)R(x l , ..., x n , y), 
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in which case its value is a number y such that R(x v ..., x„, y). We usually 
abbreviate v n (r, x v ...,x n ) as “vy ir) R(x lt ..., x n , y)” or even 
“vyR(x v ..., x n , y)” , and read it “a y (depending on r ) such that 
R(x v ...,x n , y)”. 

Similarly for l completely defined functions and predicates T*, reading 
“partial recursive in T”, "Godel number from V F”, “vyjr) ’ in 

place of "partial recursive", "Godel number", “v n ”, “vy (r) ”, respectively . 


Proof (for 1 = 0). We define 

v„( 2 , x v ...,x n )~ (i xy[T n+1 (z, x lt ...,x n , (y) 0 , (y) x ) & f7((y)!)=0]) 0 . 

Discussion. The least number operator \xy and the indefinite de¬ 
scription vy correspond to two different effective procedures for cal¬ 
culating a y such that R(x v ..., x n , y), each with respective limitations. 

Let R be effectively decidable. To calculate \xyR{x v ..., x n , y), we 
first try to settle by the algorithm for R whether R(x v ..x n ,0) is true 
or false. If it is true, we take 0 as the number sought; if false, we then try 
next to settle whether R(x v 1) is true or false; and so on. If 

before finding a y for which R{x v ..., x n ,y) is true, we come to one for 
which R(x v . y) is undefined, we are unable to get past this y, but 

must (in obedience to the rules of our calculation procedure) continue 
ad infinitum in the futile effort to settle whether R(x v ..., x n , y) is true 
or false for it. For a particular R, we might find another procedure which 
would get around this obstacle, but we cannot in general (cf. Example 6 
§64). 

To calculate vyR(x v ...,x n , y), we distribute our efforts to settle 
whether R(x v ..., x n , y) is true or false for various y’s, so that, while 
our efforts to settle whether R{x lt 0) is true or false are in progress, 

if after a certain number of steps they have not led to a decision, we then 
set to work also on R(x v ..., x n , 1), and so on. The search, if not ter¬ 
minated, will for each y eventually carry the algorithm for determining 
the truth or falsity of R(x v ..., x n , y) for that y arbitrarily far. As 

soon as R(x v _, x„, y) is found to be true for any y, we accept that y 

without attempting to settle whether it is the least. The y obtained by 
this procedure may vary with the algorithm for R (or the Godel number r) 
which is used. According to the following example (stated for n = 0), 
this is unavoidable in general. 


Example 6. Suppose w'(z) is a partial function with the two prop¬ 
erties : (a) If r is a Godel number of a partial recursive predicate R(y), 
then (Ey)R(y) -> R(v'(r)). (b) If r and s are Godel numbers of the same 
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predicate, then v'(r)~v'(s). We shall show that then v' is not partial 
recursive. For each k, let partial recursive predicates R k and S k be defined 
thus: 

R k {y) = y=0 V y=l+0*{jur 1 (A, k, z), S k {y) ^ y^-^zT^k, k, z) V y— 1. 

Let and s k be any Godel numbers of R k and S k , respectively. Given 
k, R k (0 ) and S fc (l) are true; so by (a), V(r k ) and v'(s*) are both defined. 
If (Ez)T 1 (k, k, z), then (y)[7G(y) ^ S fc (y)], i.e. R k and S k are the same 
predicate; and hence using (b), v'(r k )=v'(s k ). Thus 
(Ez)T 1 (k, k, z) -> v'(r fc )=v'(s Jt .), or contraposing (cf. *12 § 26), 
v'(r*) =£v'(s*) -> {Ez)T 1 (k, k, 2 ). Conversely, if (EzjT^k, k, z), then R k (y) 
is true only for y = 0 and S k (y) only for y = 1 , so by (a), v'(r fc ) ^v'(s fc ). 
Thus v'{r k )^'j'(s k ) = (Ez^^k, k, z) = ( z)T 1 (k , k, z). The expressions 
defining R k {y) and S k (y) are partial recursive predicates of the two 
variables k, y ; say those predicates have Godel numbers r and s, respec¬ 
tively. We can take r k = k) and s k = S}(s, k) in the above, obtaining 
the equivalence V(S\(r, k))^^'(S\(s, k)) = (z)T 1 (k, k, z). If v' were 
partial recursive, the left member would be a general recursive predicate 
of k; but the right is not (Theorem V (14)). 

§ 66. The recursion theorem. Theorem XXVI. For any n > 0, 

let F(£; x v .... *„) be a partial recursive functional, m which the function 
variable X, ranges over partial functions of n variables. Then the equation 

K(x v .. .,x n ) ~ F(£; x v ..., x n ) 

has a solution <p for X such that any solution y' for X, is an extension of y, 
and this solution y is partial recursive. 

Similarly, when Y are l partial functions and predicates, 

X(x v ...,*„) ~ F^.Y;*!, ...,x n ) 

has a solution y for X such that any solution y' for X is an extension of y, 
and this solution y is partial recursive in Y. (The first recursion theorem.) 

Proof (for / = 0, n — 1). Let <p 0 be the completely undefined 
function. Then introduce y v <p 2 , 93, ... successively by 

?i(*) — f (9o; x), y 2 {x) ~ F(cp x ; x), y 3 (x) ~ F(<p 2 ; x), ... 

Since 9 0 is completely undefined, 9 X is an extension of 9 0 ; then by Theorem 
XXI (a), y s is an extension of y v y 3 of y 2 , etc. Let 9 be the “limit function" 
of 9 0 , y v 9 a , ...; i.e. for each x, let y(x) be defined if and only if 9 s (a:) 
is defined for some s, in which case its value is the common value of 
9 s (x) for all s ;> the least such s. Now: 
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(i) For each x, y{x) F(cp; x). For consider any x. Suppose <p(x) is 
defined. Then for some s, cp(x) ~ <p s+1 (%) [by definition of 9] ~ F(9,; x) 
[by definition of 9, +1 ] ~ F(9; x) [by Theorem XXI (a), since 9 is an 
extension of 9J. Conversely, suppose F(9; x) is defined; call its value k. 
Since F is partial recursive, there is a system F of equations defining F(£; x) 
recursively from £, say with f as principal and g as given function letter; 
so now there is a deduction of f(x) = ft from Eg, F. Let g(y 1 ) = z 1 , ..., 
g(y p )=z P (where z ( — 9(y*)) be the equations of Eg occurring in this de¬ 
duction. But 9(y 1 ) = 9 Sl (>'i)- • • •• <p(y») = Vs p {y r ) for some s v ...,s p . 
Let s = max(s 1 , .. ., s p ). Then 9(y x ) = 9,(yi), ..9(y„) = 9s(yj>)- So 
g(yi)= z i> ■ • •> g(y P )= z J> e Eg 8 - Thus E| s . F h f(x)—k. Hence k~ 
F(9 s ;x) ~ (p s+1 {x) ~ 9(x). 

(ii) If for each x, <p'(x ) ~ F(9'; x), then 9' is an extension of 9. It will 
suffice to show by induction on s that, for each x, if y s {x) is defined, then 
<p'(x) ~ <p a (*). Basis: s = 0. True vacuously. Ind. step. Suppose for a 
given x that 9 s+1 (x) is defined. Then 9 <+1 (x) ~ F(9 S ; x) ~ F(9'; x) [by 
Theorem XXI (a), since by hyp. ind. 9' is an extension of 9J ~ 9 '(a). 

(iii) If F defines F x) recursively from £, and E comes from F by 

substituting the principal function symbol f for the given function symbol g, 
then E defines 9 recursively. It will suffice to show that E f- f(x)=ft, 
if and only if y s (x) ~ k for some s. We easily see that if 9 s (x) = k, then 
E [- f(x)=fe. For the converse, we show by induction on h that if there 
is a deduction of i(x)—k from E of height h, then y s {x) = k for some s. 
The deduction can be altered if necessary, so that in each inference by 
R 2 with a minor premise of the form i(y)=z only one occurrence of f(y) 
in the major premise is replaced by z (Act 1 ). The occurrences of f in 
equations of the deduction can be classified in an evident manner into 
those which come from an occurrence of f in F, and those which come via 
the substitution of f for g from an occurrence of g in F. Now consider the 
inferences by R 2 with minor premise of the form f(y)=z in which the 
f of the part f(_y) replaced comes from a g in F. Say there are p such infer¬ 
ences, the minor premises i{y-f)—z v .... f(y p )=z p of which do not stand 
above other such premises. Each of these p premises occurred above 
the endequation of the given deduction before Act 1; so using the hy¬ 
pothesis of the induction, z 1 ~ 9s 1 (y 1 ) ~ 9 s (y x ), 9 Sp (y v ) ~ 9 s (y„) 

where s = max(s 1 , ..., s P ). Now consider the tree remaining from the 
deduction after Act 1 , when all the equations above f (y^ =z v .... f(y v )—z P 
are removed (Act 2 ). In this tree, let each occurrence of f which (before 
Act 2) came from a g of F be changed back to g (Act 3). The f’s in question 
all occurred in the right members of equations, since g being the given 
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function symbol of F occurs in F only on the right; so no f is changed by 
Act 3 in what was a minor premise for R 2 before Act 3 or in the end- 
equation f(x)=ft. Finally, let the f’s of i(y 1 )—z 1 ,...,i(y p )—z p be 
changed to g (Act 4), which restores the inferences by R 2 which Act 3 
spoiled. The resulting tree is a deduction of i(x)—k from Eg*, F. Hence 

k ~ F(<p,;*) ~ <p, +1 (*). 

Example 1 . Consider the problem: to find a partial recursive function 
cp such that 

(a) ?(*) sa <f>(x) ; 

i.e. to solve the equation £(. x) ~ £(x) for C Obviously any partial function 
satisfies this equation. The partial function with the least range of 
definition which satisfies is the completely undefined function. This is 
the solution <p given by the theorem (with F(£; x) ~ £(#)). 


Example 2. To find a partial recursive function <p such that 

(a) ?(*) <p(*)+l; 

i.e. to solve Z,(x) ~ £(#) +1 for Only the completely undefined partial 
function satisfies. This of course is the solution 9 given by the theorem 
(with F(£;*)~ £(*) + !). 

Example 3. To find a function 9 partial recursive in x such that 
/ a \ f 9(0) ^ q, 

l 9(/) =* x(y. 9(y)) 

(Schema (Va) § 43). Only one function 9 satisfies for a given x, and we 
already know by Theorem XVII (a) that it is partial recursive in x- 
However to see how the theorem applies, we rewrite (a) as 

(b) 9 (,)~F( T , x: ,) where FR. *;*) = { „ , >0 

(equivalently, 

F(£, x; x) ~ & w—q} V {#>0 & w=x(x— 1, £(*— 1))}]). 

Since F(£, x; *) is partial recursive (using Theorems XVII, XX (c); or 
XVII, XVIII, XX (a)), by the theorem 9 is partial recursive in x- 

Example. 4. We give a new proof of Theorem XVIII (which proof in 
various guises appeared in Kleene 1935, 1936, 1943). Let 
9 ( 2 :) ~ jj/y[x(*, y)=0]. Then y(x) ~ <p(x, 0 ) where 

(a) 9 ( 2 , y)~ \rt t > y [i(x, 0=03. 
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But <p(x, y) is the partial function 9 with the least range of definition such 
that 


(b) 



y if x( x > y) = °> 

?(*, y') if x ( x . y) * 0. 


By Theorem XX (c) (with the first proof), the right side of (b) is of the 
form F(<p, x; x, y) where F(£, x; x, y) is partial recursive; so by the present 
theorem <p(x, y), and hence y(x), is partial recursive in x- 


Discussion. The theorem for / = 0 asserts that we can impose any 
relationship of the form 

(72) <p(*i. • ••>*») ~ F (9 ; x v x n ) 

expressing the ambiguous value y(x v ...,«„) of a function 9 in terms 
of 9 itself and x v .... x n by methods already treated in the theory of 
partial recursive functions; and conclude that the partial function with 
the least range of definition which satisfies the relationship is partial 
recursive. 

Moreover the case of the theorem for / > 0, in which 

(73) f( x v F( 9 , Y ; x v ...,x n ) 

is the relationship imposed, can be used to extend the body of the methods 
available for use in further applications. 

In our examples of special kinds of “recursion” (§§ 43, 46 and beginning 
§ 55) the ambiguous function value <p(x v ..., x n ) was expressed in terms 
of values of tne same function for sets of arguments preceding the given 
»-tuple x lt ..., x n in terms of some special ordering of the w-tuples. We 
now have a general kind of “recursion”, in which the value y{x x , ..., x n ) 
can be expressed as depending on other values of the same function in 
a quite arbitrary manner, provided only that the rule of dependence is 
describable by previously treated effective methods. 

The given “recursion” may now be ambiguous as a definition of an 
ordinary (i.e. completely defined) number-theoretic function 9, in the 
sense that it is satisfied by more than one such function (Example 1, or (b) 
in Example 4 when x(*, y) does not vanish for infinitely many values of y). 
But now we choose as the solution which interests us that partial function 
which is defined only when the recursion requires it to be. The given 
“recursion” may be inconsistent as a definition of an ordinary function 
(Example 2); again the difficulty is escaped now through the fact that it 
is only a partial function which we are seeking as the solution. Both 
these situations can arise when the F is general recursive (Examples 1 
and 2). When F is incompletely defined, the recursion may also directly 
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demand under our usual convention (§ 63 ) that 9 be undefined for some 
arguments (e.g. Example 3 when x is given to be the completely unde¬ 
fined function), as well as indirectly through an inconsistency (not 
necessarily as obvious as in Example 2 ). 

Given a particular relationship of the form ( 72 ) or ( 73 ), it may be a 
difficult problem to recognize for what arguments x v ..,, x n the function 
value «p(Aj[, ..., x n ) must be defined. This problem is separate from the 
problem, which the first recursion theorem solves, of recognizing the 
partial recursiveness of the solution 9 having the least range of definition. 

We can use the theorem in presenting the case under (A2) § 62 . Our 
methods for showing given effectively calculable functions to be partial 
recursive (and hence when completely defined, general recursive) are 
now developed to the point where they seem adequate for handling any 
effective definition of a function which might be proposed. To describe 
in ordinary language a process for calculating a new function 9, we would 
have to explain effectively how any function value <?{x v . . ., x n ) is to 
be obtained from values of 9 already calculated. In this explanation, 
we would normally employ effectively calculable functions previously 
studied, the connectives of the propositional calculus, also possibly 
bounded quantifiers (unbounded quantifiers would not be effective), and 
descriptions of the form ‘the least number such that’. This vocabulary 
translates into operations already treated in our theory (cf. particularly 
Theorems XVIII and XX). The explanation as a whole would then 
come under the first recursion theorem; namely we could express it 
as a statement that 9 is to satisfy ( 72 ) for a certain F, and is to be defined 
only when the explanation leads to a value, i.e. 9 is to be the function of 
least range of definition satisfying ( 72 ). Then by the theorem we can 
conclude that 9 is partial recursive. 

Some other operations besides those just noted can be handled by 
methods already considered, e.g. the definition of several functions 
simultaneously (cf. Example 4 § 46 ). Still other notions we might employ 
we would expect to be able to explain first in terms of the above vocabu¬ 
lary. Our results are formulated so as to constitute, not only methods for 
showing particular functions to be partial (or general) recursive, but 
also tools for enlarging our stock of methods as the need may appear. 

Theorem XXVII. For each n > 0 : Given any partial recursive func¬ 
tion <J )(z,x lt .. .,x n ), a number e can be found which defines ^{e,x v . . .,x n ) 
recursively, i.e. such that 

( 74 a) {e}{x lt ..., x n ) ~ ']>{e, *,). 
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Similarly, for l completely defined functions and predicates Y, reading 
“partial recursive in Y”, “defines recursively from Y”, “{ }' F '” in place of 
“partial recursive”, “defines recursively”, “{ }”, respectively. (The re¬ 
cursion theorem, Kleene 1938.) 

Proof (for l = 0). The function <Jj(S*(y, y), x v ..., x n ) is partial 
recursive (cf. Theorem XXIII). Let / define it recursively, and take 
e = S^f, /). Then e defines recursively the function of n variables obtained 
by substituting the number / for the variable y in <j;(S* (y, y), x v ..., x n ); 
i.e. e defines recursively <p(S*(f, /), x v ..., x n ); i.e. e defines recursively 
<^{e, x v ..., x n ), as was to be shown. 

Discussion. The theorem can be read as saying that for any partial 
recursive function ^ the equation 

*(*1. ~ ty(z, x v ...,x n ) 

can be solved for z. For the notation, cf. § 65 especially (66). If we write 
(p{x v ..., x n ) for the function e(x lt ..., x n ) defined recursively by the 
solution e of this equation, (74a) can be written 

( 74 b) <p(*i- • • •. x n ) m x v .... x n ). 

But (]>(z, x v ..., x n ) can be any partial recursive function of n- j-1 variables. 
Thus the theorem says that a partial recursive function 9 can be found 
whose ambiguous value y(x 1 , ...,x„) is given from a Godel number 
e of itself and the numbers x v ..., x n by any preassigned partial recursive 
function <J>. Then 9 can be used in constructing its own ambiguous value, 
since in building <p(e, x v ...,*„) we can use e in parts of the form 
O n (e, u v ..., u n ) (briefly e(u v ..., «„)), and <D „(e,%, ...,«„) is 
9 (u v ..., u n ). This can be emphasized by stating a corollary. (The version 
of the corollary for / > 0 is left to the reader.) 

Corollary. Given a partial recursive functional F(£; z, x 1 ,..., x n ), 
where £ ranges over partial recursive functions of n variables, there can be 
found a partial recursive function 9 and a Godel number e of 9 such that 

( 75 ) 9 ( x v • • x n ) ~ F(9; e, x v . ..,x n ). 

Given a partial recursive functional F(£, 0 lt ..., 0 r ; z, w v ..., w„ x v ..., x „), 
where £ ranges over partial recursive functions of n variables, and 0 X , ..0 r 
over partial recursive functions of specified numbers of variables, there can 
be found a partial recursive function (f>{w v ..., w T , x v .... x n ) and a Godel 
number e of 9 such that, when t v ..,, t r are any Godel numbers of 0 1( .,., 0 r , 
respectively, 

( 76 ) ^(^i) • • • > t T , x x , ..., x n ) F(9, 0 x , ..., 0 r , e, t x , •.., t r , x x , ..., x„). 
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Proof (for r = 0). By Theorem XXIV (a), we can find a partial 
recursive function t}> of n+2 variables such that 

F(£; z,x v .... x n ) ~ 2, x lt ...,x n ) 

when s is any Godel number of £ Now let £(z, x v ..., x n ) ~ <J/(z, z, 
x 1 , ..x n ). Then by the theorem we find a number e for ( 74 ). Writing 
<p(x 1 , ..x n ) for the function defined recursively by e, we have 
?(%, • • -,x n ) ~ £{e, x v .. .,x n ) ~ £{e, e, x v . ..,x n ) ~ F(<p; e, x v . ..,x„). 

Discussion (concluded). When 9 is partial recursive uniformly in £ 
we may consider that <p depends partial recursively on £ as a function; 
when 9(% 1 , . . ., x n ) is a partial recursive function of x x , .. x n and a 
Godel number of C, that 9 depends partial recursively on £ as an object. 
The gain in Theorem XXVII over XXVI is that now in imposing a 
relationship expressing the ambiguous value <?{x x , ..x n ) partial 
recursively in terms of 9 itself and x lt ..x n , we can let 9 enter not 
only as a function but also as an object. Our conclusion then is that some 
partial recursive function 9 satisfies the relationship; but in general 
different solutions 9 will be obtained from different selections of the 
Godel number / of <p(S*(y, y), x v ..x n ) in the proof of Theorem XXVII. 

Theorem XXVII does not quite include XXVI as the case 9 enters 
into the “recursion” only as a function (so that the F(£; z, x v .... x n ) of 
the corollary reduces to F(£; x v ..x n )), because the method of proof of 
Theorem XXVII does not (at least without some further argument) 
show that the partial recursive solution 9 obtained is necessarily the one 
having the least range of definition. So Theorem XXVII via its corollary 
can take the place of Theorem XXVI in supporting Theses I, I*, D (b) 
and I*t (b), but not It (a) and I*t (a). 

Of course it is only after we have Church’s thesis that we have the 
means of regarding every effectively calculable function as a number- 
theoretic object. That then an effectively calculable function can be 
found to satisfy any “recursion” in which it enters as an object besides 
as a function is a consequence of the thesis which proves useful in further 
developments. 

Example 2 (concluded). To treat this by Theorem XXVII instead 
of XXVI, take the <±{z, x) for Theorem XXVII ~ 0^2, *)+l. 

E "ample '3 (concluded). Similarly taking <\i(z, x) ~ 

F(Xa4> 1 (2, x),y,x) ~ \xw[{x=Q 8 c w=q} V {tf> 0 & w—x(x—\,<f> x (z, x— 1 ))}]. 

Example 5 . Given fixed partial recursive functions 0 1( .... 0 5 , 
to find a partial recursive function 9 with the following properties. If 
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QiW = 0 , 9 (x) = 1. If ©!(*) = 1, <?(x) ~ e,(q>(e,(*))). if m*) = 2, 

9 (v) ~ 0 4 (« 9 x ) where a is some number which defines Xy <p(0 5 (.v, y)) 
recursively. Note first that if e is any Godel number of 9, then 
<p( 6 s (*, y)) ~ 0 5 (x, y)). Now XzryO^z, 0 5 (x, y)) is a fixed partial 

recursive function. If g be a Godel number of it, then S'\(g, e, x) is 
a Godel number of Xy0 1 (c, 0 5 (x, y )), i.e. of Xy<p(0 5 (a:, y)). Now the above 
three properties are imposed on 9 by the equation 

9(at) ~ M-M'[{0 1 W = 0 & w— 1} 

V {0i(^) = 1 & v— 6 a (?( 6 s (*)))} V {Q l (x)=2 & w=0 4 (Sf(g, e, *))}]. 

This is of the form (75) for the corollary, since (using Theorems XVII, 
XVIII, XX, XXIII) the right side is of the form F( 9 ; e, x) where F(^; 2 , x) 
is a fixed partial recursive scheme function, and e may be any Godel 
number of 9 . For the theorem 

^(z, x) ~ |AW’’[{6 1 (*)=0 & w— 1} 

V {0!(*) = 1 & ie'=0 2 (<h 1 (z, 0 3 (x)))} V {6 1 (.v)=2 & «’=0 4 (S|(g, z, .v))}]. 

The problem arose as above with particular functions Oj, ..., 0 5 inKleene 
1938 . We can generalize it by considering 0 1 , ...,0 5 as unspecified 
partial recursive functions. Let t v ..., t s be any Godel numbers of them. 
If h is a Godel number of Xzte.vy Oj,(z, <J> 2 (t£', x, y)), then SJ(A, e, t 5 , x) is a 
Godel number of Xy<I) 1 (e, 4> 2 (/ 5 , at, y)), i.e. of Xy 9 ( 0 5 (.r, y)). Now the 
properties are imposed on 9 by 

<p(x) ~ 9 (i' 1 , ..., t b , x) ~ pzc[{ 0 1 (x )=0 & w= 1 } 

V {0 x (x)= 1 & tc’=0 2 (ep(d 3 (x)))} V {0 1 (x)=2 & w=0 4 (Sf(/i, e, t b , *))}]. 

By the second part of the corollary, we can find a partial recursive 
9 (ie ; 1 , ..., w 6 , x) and a Godel number e of 9 to satisfy (b). 
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COMPUTABLE FUNCTIONS 

§ 67. Turing machines. Suppose that a person is to compute 
the value of a function for a given set of arguments by following pre¬ 
assigned effective instructions. In performing the computation he will 
use a finite number of distinct symbols or tokens of some sort. He can 
have only a finite number of occurrences of symbols under observation at 
one time. He can also remember others previously observed, but again 
only a finite number. The preassigned instructions must also be finite. 
Applying the instructions to the finite number of observed and remem¬ 
bered symbols or tokens, he can perform an act that changes the situation 
in a finite way, e.g. he adds or erases some occurrences of symbols, shifts 
his observation to others, registers in his memory those just observed. 
A succession of such acts must lead him from a symbolic expression 
representing the arguments to another symbolic expression representing 
the function value. 

We now inquire whether it is not possible to analyze the acts the 
computer can perform into certain “atomic” acts, such that any per- 
formable act will be equivalent to some succession of atomic acts. The 
atomic acts would be combinations of the following: recognizing a single 
observed occurrence of a given symbol, erasing this occurrence, marking 
down a single occurrence of a symbol, displacing the point of observation 
of a given array of symbols to an adjacent point, and altering the re¬ 
membered information. 

Such an analysis was undertaken by Turing ( 1936 - 7 , received for publi¬ 
cation 28 May 1936) in the form of a definition of a kind of computing 
machine. A similar analysis was proposed briefly by Post ( 1936 , received 
for publication 7 October 1936). 

The' evidence that the analysis is complete, i.e. that to any function 
which is effectively calculable a Turing machine can be found which 
computes it, will be reviewed after we have become familiar with the 
Turing machines (§ 70). 

We begin by formulating the idea of a Turing machine, as follows. 
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The machine is supplied with a linear tape, (potentially) infinite in both 
directions (say to the left and right). The tape is divided into squares. 
Each square is capable of being blank, or of having printed, upon it any 
one of a finite list s v ..., s } (j > \) of symbols, fixed for a particular 
machine. If we write “s 0 ” to stand for “blank”, a given square can thus 
have any one of ;'+1 conditions s 0 , ..., s 3 -. The tape will be so employed 
that in any “situation” only a finite number (> 0 ) of squares will be 
printed. 

The tape will pass through the machine so that in a given "situation” 
the machine scans just one square (the scanned square). The symbol on 
this square, or s 0 if it is blank, we call the scanned symbol (even though 
s 0 is not properly a symbol). 

The machine is capable of being in any one of a finite list q 0 , ..., q k 
{k ;> 1) of {machine) states (called by Turing "machine configurations” or 
“m-configurations”). We call q 0 the passive (or terminal) state ; and 
q a , ..., q*. we call active states. The list q 0 , ..., q fc is fixed for a par¬ 
ticular machine. 

A ( tape vs. machine) situation (called by Turing “complete configu¬ 
ration”) consists in a particular printing on the tape (i.e. which squares 
are printed, and each with which of the j symbols), a particular position 
of the tape in the machine (i.e. which square is scanned), and a particular 
state (i.e. which of the &+1 states the machine is in). If the state is active, 
we call the situation active', otherwise, passive. 

Given an active situation, the machine performs an {atomic) act (called 
a "move” by Turing). The act performed is determined by the scanned 
symbol s Q and the machine state q c in the given situation. This pair 
(s B , q c ) we call the configuration. (It is active in the present case that q c 
is active; otherwise passive.) The act alters the three parts of the situation 
to produce a resulting situation, thus. First, the scanned symbol s a is 
changed to s 6 . (But a = b is permitted, in which case the “change” is 
identical.) Second, the tape is shifted in the machine (or the machine 
shifts along the tape) so that the square scanned in the resulting situation 
is either one square to the left of, or the same square as, or one square 
to the right of, the square scanned in the given situation. Third, the 
machine state q c is changed to q d . (But c — d is permitted.) 

No act is performed, if the given situation is passive. 

The machine is used in the following way. We choose some active 
situation in which to start the machine. We call this the initial situation 
or input. Our notation will be chosen so that the state in this situation 
(the initial state) is q x . The machine then performs an atomic act. If the 
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situation resulting from this act is active, the machine acts again. The 
machine continues in this manner, clicking off successive acts, as long 
and only as long as active situations result. If eventually a passive 
situation is reached, the machine is said then to stop. The situation in 
which it stops we call the terminal situation or output. 

The change from the initial situation to the terminal situation (when 
there is one) may be called the operation performed by the machine. 

To describe an atomic act, we use an expression of one of the three 
following forms: 

Sb^Qd- 

The “L”, “C”, “R” indicate that the resulting scanned square is to the 
left of, the same as (“center”), or to the right of, respectively, the given 
scanned square. 

The first part of the act (i.e. the change of s 0 to s b ) falls into four cases: 
when a — 0 and b> 0 , it is “prints s 6 ”; when a> 0 and b = 0 , “erases s 0 ”; 
when a, b> 0 and a ^b, “erases s 0 and prints s b ” or briefly “overprints 
s 6 ”; when a = b, “no change”. We often describe this part of the act as 
“prints s b ” without regard to the case. 

To define a particular machine, we must list the symbols s v 
and the active states q 1( ..., q s , and for each active configuration (s a , q c ) 
we must specify the atomic act to be performed. These specifications 
may be given by displaying the descriptions of the required acts in the 
form of a ( machine) table with k rows for the active states and j +1 
columns for the square conditions. 

Example 1 . The following table defines a machine (“Machine 2 f”) 
having only one symbol s x and only one active state q x . 

Name of Machine Scanned symbol 

machine state s 0 s x 

21 q x s x Cq 0 

Suppose the symbol s x is actually a tally mark “I”. Let us see what 
the machine does, if a tape of the following appearance is placed initially 
in the machine so that the square which we identify by writing the 
machine state q x over it is the scanned square. The conditions of all 
squares not shown will be immaterial, and will not be changed during the 
action. 
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The machine is in state q 1( and is scanning a square on which the symbol 
s x is printed. In this configuration, the atomic act ordered by the table 
is Sjftq^ i.e. no change is made in the condition of the scanned square, 
the machine shifts right, and again assumes state q t . The resulting 
situation appears as follows. 

Qi 


The next three acts lead successively to the following situations, in the 
last of which the machine stops. 


Ti 




Machine 3t performs the following operation: It seeks the first blank 
square at or to the right of the scanned square, prints a 1 there, and 
stops scanning that square. 

Now we define how a machine shall ‘compute’ a partial number- 
theoretic function 9 of n variables (cf. § 63). The definition for an or¬ 
dinary (i.e. completely defined) number-theoretic function is obtained 
by omitting the reference to the possibility that 9 ^, .. x n ) may be 
undefined. 

We begin by agreeing to represent the natural numbers 0 , 1 , 2 , ... by 
the sequences of tallies I, II, III, ..., respectively, the tally " 1 ” being 
the symbol There are y+1 tallies in the representation of the natural 
number y. 

Then to represent an m-tuple y v ..., y m (m ;> 1 ) of natural numbers 
on the tape, we print the corresponding numbers of tallies, leaving a 
single blank between each two groups of tallies and before the first and 
after the last. 
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Example 2. The triple 3, 0, 2 is represented thus: 


III 


We say that (the representation of) a number y (or of an w-tuple 
y v ..., y m ) on the tape is ( scanned) in standard position, when the 
scanned square is the one bearing the last tally in the representation of 
y (or of y m ). 

Now we say that a given machine 9JI computes a given partial function 
9 of n variables (n > 1 ), if the following holds for each w-tuple x v ...,%„ 
of natural numbers. (For the case n = 0, cf. Remark 1 below.) Let 
x v ..., x n be represented on the tape, with the tape blank elsewhere, i.e. 
outside of the x x -\- ... -}-x n -\-2n-\-\ squares required for the repre¬ 
sentation. Let 9)1 be started scanning the representation of x v ..., x n 
in standard position. Then 90? will eventually stop with the ti +1 -tuple 
x v ..., x n , x represented on the tape and scanned in standard position, 
if and only if <p{x lt is defined and <p(x x , ..., x n ) = x. (If <p(x v ..., x n ) 

is undefined, 9)1 may fail to stop. It may stop but without an n+1-tuple 
x v ..., x n , x scanned in standard position.) 


Example 2 (concluded). If 9 ( 3 , 0, 2 ) = 1 and 90? computes 9 , then 
when 9 )? is started in the situation 


qi 



with all squares other than those shown blank, it must eventually stop 
in the situation 

q 0 

1 1 1 1 1 111 |i 1 


where the condition of the squares other than those shown is immaterial. 

Although only one symbol s x or “I” is used in stating the arguments 
and in receiving the function value, others may be used in the progress 
of the computation. For each n > 1 , each machine (with its first symbol 
s x serving as the tally) computes a certain partial function of n variables. 

A partial function 9 is computable, if there is a machine 9D? which 
computes it. 
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We have not attempted here to reproduce the detailed formulation of 
Turing 1936-7, but only his general conception of the behavior of the 
machines. Although he noted a variety of applications of his machines, 
he confined his detailed development to machines for computing dual 
expansions of real numbers x (0 < x < 1). The successive digits were to be 
printed ad infinitum on alternate squares of a 1-way infinite tape, while 
the intervening squares were reserved for temporary notes serving as 
scratch work in the continuing computation. It should be an easy 
exercise, after §§ 68 and 69 , to show that a machine exists which does 
this, if and only if the n-th digit in the dual expansion is a computable 
function of n. A critique, of help to one who would study Turing’s 
paper in detail, is given in the appendix to Post 1947. Our treatment 
here is closer in some respects to Post 1936. Post 1936 considered 
computation with a 2-way infinite tape and only 1 symbol. 

Our main objective next is to prove the equivalence of computability 
with partial recursiveness, or, when only completely defined func¬ 
tions are considered, with general recursiveness (§§ 68, 69 ). 

It is of some interest to see at the same time whether a 1 -way infinite 
tape and 1 symbol will not suffice. We say that a machine 501 1/1 computes 
9 , if it computes 9 subject to the following restrictions, and we say that 
9 is 1/1 computable , if it is 1/1 computed by some machine 501: (i) The 
machine 50? has only one symbol s x . Moreover, when 50? has been started 
as described for any w-tuple x v ..., x n , it behaves so that the following 
{(ii) — (iv)) are the case, (ii) A square which in the initial situation is to 
the left of the representation of x v ..x n (i.e. to the left of the blank 
preceding the first tally of x t ) will not be scanned in any subsequent situ¬ 
ation. Hence the computation can equally well be performed on a 1 -way 
infinite tape (infinite to the right), (iii) If y{x lt ..x n ) is defined, then 
in the terminal situation the representation of x v ..x n , <p(x lt ..x n ) 
starts on the same square as did that of x v ..., x n in the initial situation, 
and all squares to the right of the representation of x v ..x n , <p(x v ..., x n ) 
are blank (also by (ii), those to the left), (iv) The machine 50? eventually 
stops, only if <f>(x v ...,*„) is defined. 

It will follow from the results of §§ 68 and 69 that 1/1 computability 
is equivalent to computability; and hence that all wjs computability 
notions are equivalent, where w is 1 or 2 according as (ii) — (iv) are re¬ 
quired or not, and 5 is an upper bound for the numbers of symbols different 
machines may have (00, if there is no finite upper bound). (In this scale, 
our original notion of computability is 2 /oo computability.) 

Other variants of ‘computability’ can be formulated, e.g. we might 
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omit (iii) and (iv) while retaining (ii), or instead of (ii) we might assume 
that the machine is supplied with only a 1 -way infinite tape, and require 
it to stop if a situation is reached from which the machine table orders 
a motion leftward from the leftmost square. Instead of representing the 
numbers on the tape by tallies, we might use dual notation or decimal 
notation. It should not be hard to show, after §§ 68 and 69, that each of 
these other computability notions is equivalent to ours (cf. § 70). 

Now we extend our notions to the case of computation from l com¬ 
pletely defined functions <jq, ..., <J/j (briefly Y) of m v ..., m l variables, 
respectively. Here the idea is modified by assuming that any value of 
one of the functions Y, if demanded in the course of the computation, 
will thereupon be supplied (cf. end § 61). 

A machine for this purpose may have among its active states q 1( ,.., q* 
ones from which a new kind of act (not atomic in character) is performed. 
Let q c be one of these states. When the state is q c , and (for a certain i 
depending on c) an m r tuple y v ..., y mi is scanned in standard position, 
the act performed by the machine shall consist in supplying next to the 
right of this m r tuple pi{y lt . .., y TO< ) +1 tallies and a blank, at the same 
time displacing by i}q(yi, • • •. y m ,-)+2 squares to the right all printing 
which previously existed to the right of the scanned square, and then 
assuming state q d (depending on c) with the resulting 1-tuple 

y v ..., y mi , ^ i(y v .... y mi ) scanned in standard position. When the state 
is q„ but an w r tuple is not scanned in standard position, we shall consider 
that the machine performs an identical act (with resulting situation the 
same as the original situation). In the machine table, corresponding 
to each such state q c , we specify only the pair iq d , as in other respects 
the act is determined from the situation by the definitions just given 
and the functions Y. 

Modifying our machine notion in this way, we obtain the three notions : 
machine from Y, computes 9 from Y, 9 is computable from Y. If (for 
fixed n, l, m v ..mf) 9 is computable from Y by a machine 9K, the table 
for which is independent of Y, we say that 9 is computable uniformly from 
Y, or that the functional 9 ~ F(Y) is computable. (The theorems of 
§§ 68 and 69 establish uniformity, whenever uniformity is assumed.) 

In defining Wl 1 /1 computes 9 from Y, and 9 is 1/1 computable from Y, 
we add the following to our former list (i) — (iv) of restrictions: (v) For 
each state q c for which the table entry is of the form iq d , the state q c 
is reached only when some m r tuple y lt ..., y m> is scanned in standard 
position and all squares to the right of the scanned representation of 
y%, • • •. y m , are blank. 
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We can easily formulate the definition of ‘computability’ from assumed 
functions, say from ^ (with l — m 1 = 1 ), in an equivalent way, in which 
the machine performs only acts atomic in character, but is supplied with 
a tape having a (patentially) infinite printing representing the sequence 
of the values of This printing may be on a second tape, or on alternate 
squares of the one tape. 

Remark 1 . In this chapter, outside the present remark and passages 
referring to it, we shall understand that we are dealing with functions of 
n > 1 variables. Since we have not provided for representing w-tuples of 
natural numbers on the tape for n — 0 , we say a machine computes a 
function 9 of 0 variables, if it computes the function cp(v) of 1 variable such 
that (f(x) ~ 9. For an assumed function of 0 variables, the non-identical 
act from a state q c with table entry i(\ d is to be performed when any 
1 -tuple x is represented on the tape. With these definitions, it will follow, 
as soon as we have proved Theorems XXVIII—XXX for functions of 
> 0 variables, that they also hold for functions of 0 variables. 

§ 68 . Computability of recursive functions. Theorem XXVIII. 
Every partial recursive function 9 is 1/1 computable. Every function cp 
partial recursive in l completely defined functions V F is 1/1 computable from T\ 

The proof, given at the end of this section, will be based on Corollary 
Theorem XIX § 63 (for 1 = 0 and a general recursive 9 , on Corollary 
Theorem IX § 58). An intuitive calculation by Schemata (I) — (VI) is 
accomplished by repetitions of a few simple operations, such as copying 
a number previously written (at a determinate earlier position), adding or 
subtracting one, deciding whether a given number is 0 or not 0 . We 
shall first construct some machines to perform such operations as these. 

We begin by introducing some notations which will be convenient 
here, including modifications of notations used in § 67. 

A given square of the tape must now have one of two conditions s 0 
and s x , or blank and printed with a tally “I”, or white and black. In the 
illustrations in this section, it will be convenient to show these two con¬ 
ditions as “ 0 ” and “ 1 ”, respectively. 

To identify the scanned symbol in our illustrations, we now usually 
write simply a bar over it, instead of the machine state as in § 67. 
Sometimes a small numeral " 1 ” or " 2 ” will be written with the bar to 
indicate a certain machine state (the same in all occurrences of that 
numeral), to which we wish to call attention without indicating at the 
moment which of q 0 , ..., q fc it is. 
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In describing an atomic act, we now write “P” ("prints”) for changing 
0 to 1 (i.e. s 0 to SjJ, “E” ("erases”) for changing 1 to 0 (i.e. s a to s 0 ), and 
nothing for the change s a to s 6 when a — b. We omit the “C” which we 
used in § 67 to signify no change in the scanned square. We now write 
simply "0”, "1”,..., "k” for the machine states q 0 , ...,q*. For 
example, the act written "SiCq,,” in § 67 is now written as "P0” when s 0 
was the scanned symbol initially, and as "0” when s x was the scanned 
symbol initially. 

As a further example, we repeat the table for Machine 21 (Example I 
§ 67), and the former illustration of the operation it performs (showing 
now only the initial and final situations). 

Name of Machine Scanned symbol 

machine state 0 1 

21 1 P0 PI 

Starting from the initial situation 

T 1 1 0, 

Machine 21 reaches the terminal situation 

1 1 1 T. 

Machine 21 prints on the first blank square at or to the right of the 
scanned square, and stops there. 

In our illustrations prior to "Proof of Theorem XXVIII”, none of the 
squares to the left or right of those shown is scanned in any intermediate 
situation during the machine action. Thus their condition is immaterial. 

For each machine constructed prior to "Proof of Theorem XXVIII”, 
we shall state what is the leftmost square scanned in any intermediate 
situation, in each case when it falls outside the interval between the 
initially and terminally scanned squares inclusive. This information 
will be used in concluding that the machines we build in the proof of the 
theorem satisfy the restriction (ii) § 67. 

In this section we shall generally think of a sequence of y-f-1 con¬ 
secutive printed squares preceded and followed by a blank square as 
representing the natural number y. The same blank square may be 
regarded simultaneously as the last square in the representation of one 
number and the first in the representation of another. This enables us 
to think of< any printing on the (2-way infinite) tape as consisting of 
a finite succession of representations of natural numbers. More than one 
consecutive blank squares (say z -fl of them) between two groups of 
consecutive printed squares we call a gap (of z squares) between the 
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numbers represented. The blank portion of the tape before the first, or 
after the last, printed square we also call a gap (of co squares). As in 
§ 67 , we do not think of m successive representations of natural numbers on 
the tape as representing an w-tuple, unless there are no gaps between them. 

Similar remarks apply to a 1-way infinite tape, provided the first 
(leftmost) square is blank. There may or may not be a (finite) gap before 
the first number. 


Now consider the problem of constructing a machine 93 to perform the 
following operation: Given a number on the tape, not the leftmost, to 
move this number to the left so as to close the gap (if any) between it 
and the preceding number. The given number is to be scanned in standard 
position before and after the operation. The leftmost square scanned in 
any situation during the action shall be the rightmost printed square 
of the preceding number. 

We begin with an illustration, showing a typical initial situation 
(Situation 1), some proposed intermediate situations not necessarily 
consecutive (Situations 2 — 16 ), and the desired terminal situation 
(Situation 17 ), for one plan for mechanizing the operation. Explanations, 
and the table of a machine which will perform the operation according 
to this plan, will follow. The '‘ 93 !”, " 93 2 ”, “333”, * 31 ” at the right refer to 
an analysis of this machine as a combination of simpler machines, given 
afterwards. 
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The step from Situation 1 to Situation 2 is an erasure of the scanned 1, 
and a motion once to the left. Then (2 — 3) the machine seeks the first 
blank square at or to the left of the scanned square; (3 — 4) it prints 
there, and goes one more square to the left; (4 — 5) it recognizes (in 
this illustration) and registers that it is now on a blank square, and 
goes one square to the right; (5 — 6) it goes to standard position over the 
number now being scanned. Altogether 1 — 6 move the number as a 
whole one square to the left. Then 6 — 11 repeat this operation. The 
same cycle of steps is started again (11 — 14), but is concluded dif¬ 
ferently (14— 17) after 1 instead of 0 is discovered on the leftmost 
square examined (at Situation 14). 

The reader may verify that the machine defined by the following 
table will carry out the operation in this illustration, passing through 
Situations 1 — 17 and other intermediate situations; and he may con¬ 
vince himself now (or after the next discussion) that it will always perform 
the operation described, starting from any initial situation of the kind 
described. We leave a dash “—” in place of the atomic act order at those 
places in the table where the act ordered is immaterial to us, because the 
configuration is one which will not arise in our use of the machine, i.e. 
when it is started from an initial situation of the kind described. However 
the definition of the machine may be understood to be completed by 
supplying “0” at those places. 


Name of 

Machine 

Scanned symbol 

machine 

state 

0 

1 

93 

1 

— 

EL2 


2 

PL2 

L2 


3 

R4 

R5 


4 

L\ 

R4 


5 

— 

ER6 


6 

P0 

R6 


In discovering this machine, we first analyzed the whole operation we 
wish it to perform into simpler (not necessarily atomic) acts, and then 
further analyzed these into the atomic acts to be performed by the 
machine. We can think of the machine as obtained by connecting together 
several simpler machines which perform one or a succession of the steps in 
the preliminary analysis. We now introduce notations which will be 
useful in describing a machine as a combination of simpler machines. 

If two operations are to be performed successively, and a machine 
X performs the first of them, and a machine ?) the other, then the combined 
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operation will be performed by the machine we get by identifying the 
passive (or terminal) state of 36 with the first active (or initial) state of 
2). The output of 36 thus becomes the input of 9). The resulting machine 
we denote by “36?)”. (Then (369))3 = 36(?)3)-) 

We use “36”” for n > 0 to mean 36 ... 36 (n factors); and “36°” for 
a machine whose table has only 0 as entries, so that 36°?) and 2)36° both 
perform the same operation as 9). 

We may also wish the output of one machine 36 to become the input 
of either a machine f) ora machine 3. depending on some circumstance 
arising during the action of 36. We can provide for this by extending our 
notion of a Turing machine to allow two terminal states ((, and 0 2 . Here 
such 2 -terminal machines will be used only as components in the con¬ 
struction ultimately of machines as defined in § 67 with one terminal 
state. Upon identifying the state 0 X of 36 with the initial state of 9) and 
the state 0 2 of 36 with the initial state of 3> we obtain a machine which 

we denote by “36 j 

We shall now express Machine 33 as a combination of several machines. 
First, we define a machine 93 x which performs the operation illustrated 
by 1 —5 or by 6— 10 or by 11 — 15; i.e. such that, when started 
scanning a number in standard position, it erases the scanned square, 
prints on the first blank square to the left, and goes to state 0 X or 0 2 
scanning that square, according as the square next left of that is blank or 
printed. The leftmost square scanned during the operation is the square 
next left of the terminally scanned square. This machine has the following 
table (cf. Lines 1 — 3 of the table for Machine 93). 


Name of Machine 

machine state 


1 

2 

3 


Scanned symbol 
0 1 

— EL2 

PL3 L2 

R0 1 R0 2 


The operation illustrated by 5 — 6 or 10 — 11 is performed by the 
machine with the table (cf. Line 4 of the table for Machine 93): 

93 2 1 LQ R\ 

To perform 15— 16, similarly we have (cf. Line 5 for Machine 93): 

93 3 1 — ERO 

The operation illustrated by 16— 17 is performed by Machine 21. 
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The action of Machine 95 can be described in terms of 91, 93 1( 93 2 and 
95 3 , thus. We first use 95!, then according as the terminal state is 0 X or 0 2 , 
we use 95 2 or 95 3 . In the case that 95 2 is used, its output is fed back into 93 x . 
In the case that 95 3 is used, its output is fed into 91. We express this by 
the formula 

(a > 8 - {%«. 

where the dots express that the output of 95 2 is fed back as input for 95 r 
The notation is suggested by that for repeating decimals. If the operation 
performed by 93 x each time produced an output with state 0 1( the oper¬ 
ation performed by 95i93 2 would be repeated ad infinitum. This would 
actually happen if Machine 95 were started scanning the leftmost number 
on the tape in standard position; Machine 95 would then keep moving 
this number square after square to the left. 

Formula (a) indicates how to construct the table for Machine 95 from 
those for 91, 95i, 95 2 and 95 3 . In the table for 95 1; we replace the terminal 
states 0 X and 0 2 by the initial states for 93 2 and 95 3 , respectively, re¬ 
numbering these suitably as 4 and 5; etc. (cf. the table for 95 as first given) 

Now let us build a machine (for each fixed w > 1) such that: 
Machine $ m , when started scanning in standard position the represen¬ 
tation of an w-tuple y v ..., y m of numbers, with all squares (or at least 
the first y x -\-2 of them) to the right of the w-tuple blank, copies y x 
following the w-tuple without a gap, and stops scanning the copy in 
standard position. The leftmost square scanned during the action is the 
first (blank) square of the given w-tuple. 

For example, from the initial situation shown next (Situation 1), 
shall reach the situation shown last (Situation 16). A plan for the action 
is indicated by the intervening situations shown. 
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1. 

0 

1 

1 

l 

0 

1 

0 

1 

1 

1 

1 

0 

1 

2. 

0 

1 

1 

i 

0 

1 

0 

1 

1 

1 

1 

0 

1 

3. 

0 

1 

1 

T 

0 

1 

0 

1 

1 

1 

1 

0 

1 

4. 

0 

1 

1 

T 2 

0 

1 

0 

1 

1 

1 

1 

0 

1 

5. 

0 

1 

T 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

6. 

0 

1 

i 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

7. 

0 

1 

i 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

8. 

0 

1 

T 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

9. 

0 

1 

T 2 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

10. 

0 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

11. 

0 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

12. 

0 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

13. 

0 

1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

14. 

0 

T 1 

0 

0 

0 

1 

0 

1 

1 

1 

1 

0 

1 

15. 

0 

1 

l 

T 

0 

1 

0 

1 

1 

1 

1 

0 

1 

16. 

0 

1 

l 

l 

0 

1 

0 

1 

1 

1 

1 

0 

1 


10 0 0 
10 10 
10 10 
10 10 
10 10 
10 10 
10 11 
10 11 
10 11 
1 0 1 l 

l o i T 
10 11 
10 11 
10 11 
10 11 
10 11 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
1 0 
1 0 
1 0 
T o 



Machine (£, according to the plan, when started scanning in standard 
position a number which is followed by a gap, goes right two squares, 
prints and stops there. 

Machine 2), when started scanning in standard position a number 
which is not the leftmost.on the tape, goes left to standard position on 
the next number to the left. Hence 2> m , when started scanning in standard 
position a number to the left of which at least m numbers occur, goes left 
passing over m — 1 intervening numbers to standard position on the m -th 
number to the left. 

Machine ©, when started in standard position on a number, decides 
whether that number is 0 or greater than 0, and assumes state 0 X or 0 2 
according to which is the case, with the number still scanned in standard 
position. The leftmost square scanned during the operation is the square 
next left of the scanned square. 

Machine when started scanning a printed square, erases and goes 
left one square. 

Machine & goes one number (& m goes m numbers) to the right, just 
as 2) goes one number (2) m goes m numbers) to the left. 

Machine §, when started scanning in standard position a number 
which is not the rightmost on the tape, fills up the gap (if any) between 
that number and the next one to the right, increasing the first number by 
the number of squares that constituted the gap, and stops scanning the 
resulting number in standard position. 
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If we can find machines (E, ®, § as described, then Machine 

will be given by the formula 

(b) 3. = { |®Ijj 

The operation performed by is repeated, so long as the ap¬ 

plication of Gs gives state 0 2 ; but when it gives 0 1 , the action is terminated 
by !g® m . 

We give tables for ®@ and §, leaving those for g, g and © to the reader. 


Name of 

Machine 

Scanned symbol 

machine 

state 

0 

1 

$ 

1 

L2 

LI 


2 

L2 

0 

e 

1 

— 

L2 


2 

R0 1 

R0 2 

§ 

1 

— 

R2 


2 

PR2 

L3 


3 

— 

EL0 


By several applications of with suitable values of m, we can copy 
without gaps any permutation (allowing repetitions) of the numbers 
represented rightmost on the tape without gaps between. For example, 
from the initial situation 

01 1 10101 1 1 1 0 1 I 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0, 

the machine $ 4 (= (^ 4 ) 4 ) reaches the terminal situation 

011101011110110111010111101T00, 

the whole quadruple being copied; and reaches 

01 1 10101 1 1 101 101 1 1 0 1 T 0 0 0 0 0 0 0 0 0, 

the first and fourth numbers being copied. 

The machine 51 m defined thus 
(c) 51’ m = 

when started in standard position on an m-tuple y v ..., y m , with all 
squares (or at least the first y x +... +y m -\-2tn-\- 1) to the right blank, 
copies the m-tuple to the right after leaving a one-square gap, and stops 
scanning the copy in standard position. The leftmost square scanned 
during the' action is the first (blank) square of the given m-tuple. For 
example, from the above initial situation, 5? 4 reaches the terminal sit¬ 
uation 

011101 0 1111011001110101111017 0. 
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We now wish a machine 2 which, started scanning a number in standard 
position, will erase all numbers (if any) to the left of this up to the first 
gap encountered, and return to standard position on the number originally 
scanned. The leftmost square scanned in the operation shall be the square 
next rightmost in the group of consecutive blank squares constituting the 
gap. For example, from the situation 

001110111101101011 T 0 , 

Machine 2 reaches the terminal situation 

00000000000000001 iTo. 

The construction of 2 is left to the reader. 

Proof of Theorem XXVIII for 1 = 0. The proof is by an induction 
based on Corollary Theorem XIX § 63. There are six cases, according 
to which one of the schemata (I) — (VI) is applied last in the given 
definition of <p by these schemata. For each case, we must show how to 
construct a machine 2J? 9 which 1/1 computes <p. 

To describe the initial situation in the computation of <p(x u . ..,*„) 
for a given n-tuple x v . . ., x n we now write: 

1 . , Xj, V » • , Xfi! 

Here each “ x stands for **+ 1 consecutive printed squares, the commas 
stand for blanks forming part of the representation of the w-tuple, and 
the bar indicates that the representation of x n is scanned in standard 
position. Under the definition of computability (§ 67), all other squares 
to the left and right are also blank. The terminal situation, which 
must reach if cp(x 1( .. ., x„) is defined, we may write similarly: 

,x v ...,x n , cp(x v ...,x n ), 

The first comma here and in the abbreviation for the initial situation 
(Situation 1) refer to the same square of the tape, and here again all 
squares not indicated are blank (for (iii) § 67). When started from 
Situation 1, shall stop eventually, only if <p(x 1 , . . . ,.r„) is defined (for 
(iv)). From Situation 1, 9J? 0 may go arbitrarily far to the right, but must 
never scan a square to the left of the one represented here by the first 
comma (for (ii)). 

Case (I) (Schema (I)): <p(x) = V. Let S» 9 = 

Case (II): <p(x x , .... x n ) = q. Let 9)? 9 = (E9K 
Case (III): y(x v ..., x„) = Let 91^ = 

Case (IV): cp^, ..., x „) ~ • • •. *«)> • • •. X ».(*v ■ ■ ■> *«))• 

By the hypothesis of the induction, there are machines • • •. Wy. m , 9)?,^ 
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which 1/1 compute Xi. • • •, Xm» <J>, respectively. The plan of the computa¬ 
tion of (?{x v ..., x n ) is as follows. Starting from Situation 1, we copy the 
n-tuple x v ..., x n with a one-square gap (shown by the double comma): 

2. , X-^p . • %n> > x \> ■ ■ m Xnp 

The gap marks the beginning of a temporary record, to be erased later. 
Machine $ n performs this copying operation. Machine 9Q£ Xi , if started 
from the situation , x lt .... x n , , will go to the situation 

, x v Xi(*i» •••.*»)> . provided Xi( x i> • • •> *») is defined. In 

performing this operation, 9K Zi does not in any intermediate situation 
scan a square to the left of the first square in , x v ..x n , 
Hence if is started from Situation 2, the presence of the additional 
printing ,x v .... x„, to the left will not affect its action. So 
from Situation 2, 901^ will go to the following, provided Xi( x v •••,*«) 
is defined: 

3. , Xj^, . . . t X n> , X^p • • •, X nJ XlC^li * * • j ^n)j • 

Next we copy x v .... x n , without a gap, by Machine $” +1 , obtaining: 

4. , X^p . . ., X n , t X^p . . ., X nJ Xl(^l> ‘ ‘ > x n)> x x> • • •» Xn, 

Applying 90lx s , if Xz( x i> • • •> x n) is defined, we obtain: 

5. , X j, • • • t x n j , x^ f ..., x nt XiK. •••» %n)t %1> • • • > %n> X 2 (x v .. .,x n ), . 

Continuing in this manner, we eventually obtain the following, provided 
all the function values appearing are defined: 

6. , X-y, . . ., X n , , ...» X n , Xl( x l> • • • > x n)> • • • > x l> • • • i x n> Xmi X l> • • •» x n)> 

Xl{ X l> • • •> x n)> • • •> Xm[ X l> • • ‘< X n)> 4 l (Xl('' ! 'l> * • • > X rt) > • • •> X»»(*l> • • ‘> X n))> • 

By Machine 2 we can then erase every number between the scanned 
number <KXi(*i> ••♦»*«)» • • •< X»»(*i> • • •> *»)) (non-inclusive) and the gap 
(represented by the double comma), after which we can close up the gap 
by Machine S3 to obtain: 

7. , X lt . . ., X n , i>{Xl{ x l> • • •> X n)> • • •> Xn»(*l» • • •> x n))> » i>6. 

, X^p . . ., # n , ^p(^ii • • • ) X n)t 

The entire operation, when <p(%, ...,#„) is defined, is performed by the 
machine defined thus: 

= M Xl • • • X+i^x* S(m-l}(»+l)+l3(m-2)(n+l)+2' • • 

Conversely, using the property (iv) of the machines 9K Xj , ..., and 
9ft,Machine 90^ applied to Situation 1 will eventually stop, only if all 
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Ol Xl(*l> • • • < X n)t • • • > • • • i X„), 'MXlC^l* • • • > x n)> • • • > Y.m{ x 1> • • • » ^n)) 

are defined, i.e. only if cp^, ..., x n ) is defined. 

Case (V): For n > 1 (Schema (Vb)), cp(0, x 2 , ~ ty(x 2 , ...,x n ), 

<p(y'i x 2 , • • •, Xn) — x(y> ?(y, x 2> x 2 , .... x n ). By the hypothesis 

of the induction, there are machines and which 1/1 compute 
4< and i, respectively. We carry out a series of operations to obtain the 
following situation (corresponding to Situation 6 for Case (IV)), if the 
required function values are defined. At each place where a choice is 
indicated, the upper alternative applies under the condition stated, the 
lower otherwise; i.e. for a given y, the lower alternative is taken on the 
first y choices, and the upper on the y +1 -st choice. 

j y> X 2> • • •» x n> > yi X 2> • • •» Xftr < \ > [ X 2’ • • • > x n)> 

( <k* 2 > if y = o. 

1 0, ty(x 2 , . . ., Xjf), X 2 , . . ., X nt 5^(0, fy(x 2 , • • •) %n)> X 2> ' ' '' X n)t 

J f x(0, • • •, *»), X 2 , .... x„), if y— 1 = 0. 

\ 1, X(^» ty( X Z> * * *> X n)> X 2 » • • •> x n)> X 2> ...» X nt 
X0< X(9< t M*2> ■ . •. *n), *2» • • •» *»), *2> • • •>*»)> 

, o / X0> X(°»+(^ 2 . • • •> X n), x 2< • • •> X n), X 2 , . . ., x n ), if y— 2 = 0. 

5 '“ 2 ’ l 2. 

Then everything between the last number and the gap is erased, and 
the gap is closed up. The entire operation is performed by the machine 
9JL defined thus: 



Slight changes adapt the treatment to the case n = 1 (Schema (Va)). 

Case (VI): y(x lt ~ f*y[x(*i> x n, y)=0] (for n^l; cf. 

Remark 1 end § 67). Handled like Case (V) (but more simply). 

Proof of Theorem XXVIII for l > 0. We now have as an additional 
case that 9 is one, say ip 0 of the assumed functions Y. In this case 9 is 
1/1 computed from Y by the machine having a single active state 
q x with the table entry fq 0 . 

§ 69. Recursiveness of computable functions. In this section the 
notations will be those of § 67 rather than § 68 . 

Theorem XXIX. Every computable partial function 9 is partial 
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recursive. Every partial function 9 computable from l completely defined 
functions Y is partial recursive in Y. 

Proof for 1 = 0. It is possible to represent the tape vs. machine 
situations of a given machine fD? by formal expressions (as indeed we 
shall do for another purpose in §71, where the expressions will be called 
“Post words”). The theorem can then be considered as an application 
of (D2) §62 (extended to partial functions; here E X) . +1 depends only 
on E ai ). 

If details are to be supplied, the following proof is a little more direct. 
Godel numbers are assigned directly to the tape vs. machine situations; 
and instead of giving Godel numbers to deductions or proofs (as for 
Theorem IX § 58 and (Dl) § 62), we simply number the successive situ¬ 
ations reached from a given situation by a given machine as the 0-th, 

1-st, .... >th,_Then if one will read “function” for “partial function” 

and "general recursive” for "partial recursive”, he need not use Chapter 
XII. The proof can be shortened slightly by using Theorem XXVI § 66. 

In establishing the Godel numbering of situations, we describe the 
condition of the tape only relative to the scanned square, instead of 
relative to some fixed square. This suffices for our purpose here, since 
absolute position on the tape does not enter into the definition of com¬ 
putability. (For 1/s computability, absolute position or at least position 
relative to the first square of the representation of x v ..., x n in the 
initial situation does matter, by (ii) and (iii).) 

We begin by assigning a Godel number to the (condition of the) tape 
to the left of any particular square. Let the conditions of the successive 

squares to the left of that square be s Uo , s Ui , s Uj ,_Since only a finite 

number of squares are printed, all but a finite number of the subscripts 
«,• are 0. The Godel number u of the tape to the left of the square shall be 
II />“», i.e. II p'fi where t is any number which exceeds every i for which 

i i<t 

«, ^ 0. Then u itself is such a number; so u = II $*» =--■ n p\ u) *. 

i<u i<u 

A Godel number is assigned similarly to represent the condition of the 
tape to the right of a given square. 

If in a given situation, the Godel number of the tape to the left of the 
scanned square is u, the scanned symbol is s a , the machine state is q,., 
and the Godel number of the tape to the right of the scanned square is 
v, then the Godel number of the situation shall be 2“ • 3" • 5 C • 7’’. 
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Example 1 . The Godel number of the situation 



where all squares not shown are blank is 2 fil ‘ 3 l ‘ 2 S *3 1 -5 4 -7 2 ° -31 . 


If the Godel number of the situation is 2 U • 3° • 5 C • 7®, the configuration is 
(s a , q c ). If c> 0, and the act which the machine performs from this 
configuration is of the form s b Lq d , the Godel number p a c (u, v) of the re¬ 
sulting situation is 

[2 exp n $“>.'] • 3 ( “ )o • 5 d • [7 exp {2 b • n pf <}}; 

i<u i<v 


and similarly if the act is s b Rq d . If the act is s^Cq^, then p a c (u, v) is 
2 u. 36 . 5 d. 7 t> E ac h of these functions p a c is primitive recursive. 

Now p defined as follows is primitive recursive (#F § 45), and has the 
property that, if w is the Godel number of an active (passive) situation, 
then p (w) is the Godel number of the next (same) situation: 


9 (w) = 


Pa,c(No- (»)«) if Mi=a & (: w) t =c 

(a = 0 , c = 1 , ..., k), 

w otherwise. 


Thence we define a primitive recursive function 6 thus: 

f 0 (ze>, 0) = w, 

[ 0 \w, z') = p(0(^, z)). 

If w is the Godel number of any situation, then Q(w, z) is the Godel number 
of the situation after the next z acts, if the machine performs at least z 
acts from the given situation; and Q(w, z) is the Godel number of the ter¬ 
minal situation reached from the given situation, if the machine performs 
< z acts from the given situation. 

Next, for each n > 1 (cf. Remark 1 end § 67), we define a primitive 
recursive function t„ with the following property. If the w-tuple x v ..., x n 
is scanned in standard position, the state is q c , the Godel number of the 
tape to +he left of the representation of x v ..., x n is u, and the Godel 
number of the tape to the right of that representation is v, then t b (x x , ..., 
x n , c, u, v) is the Godel number of the situation. First we define thus: 

Tifa, C, u, v) = 

[2 exp {( n Pb-Pls n exp {2°- II pf% 

i<x 1 i<u i<v 
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Then, for n — 1, 2, 3, ...: 

t„ +1 (* 1; x n+1 , c, u, v) = 

~l(*n+l> c - I. • • *«-!> x' n , C, U, v)) 0 , v). 

When x lt ..., x n is scanned in standard position with state q 1( and 
the tape is blank elsewhere, the Godel number of the situation is 
x n (x v ..., x n , 1, 1, 1). When x v ..., x„, x is scanned in standard position 
with state q 0 , the Godel number of the situation is x n+1 (x v ..., x n , x, 0, u, v) 
for some u and v ; and conversely. 

Now, if 9 is the partial function of n variables computed by the given 
machine 2 K, and x v .... x n is a given w-tuple, then <p(x lt x n ) is defined, 
if and only if there exists a quadruple (z, x, u, v) of numbers such that 
Q(x n {x v 1 , 1 , 1 ), z) = Tn+ifo, ...,x n ,x, 0 , u, v), in which case x 

is the value of 9 (%, Accordingly, 

?(*i> ■ - -,x n ) ~ 

(|a/[ 0 (x n (^i, .. x n , 1 , 1 , 1 ), (^)o) = T n+ 1 (xj, ..., x n , (t) j, 0 , (t ) 2 , (^)a)])x- 
Therefore, by Theorem XVIII § 63, 9 is partial recursive (or if 9 is 
completely defined, by Theorem III § 57, 9 is general recursive). 

Proof for / > 0 . Say e.g. there is one assumed function ^ of one 
variable (i.e. I = m 1 = 1). Now, for each c for which the table entry 
corresponding to q c is of the form lq d , we replace “p a>c ((“')o >{ w )z) if 
(w) 1 =a & (u>) 2 =c (a — 0 , in the definition of p (w) by “p c (w) 

if Q c (w)”, where Q c is the primitive recursive predicate and p c the function 
primitive recursive in ^ defined thus: 

QM m (Ey) y<w {Eu) u<w {Ev) v<w [w=c, u,v)}, 

P c(w) = t 2 (Y, d, U, V) where 

Y = (Ay 1 /<u ,(£M) w<u ,(£:^ <U) [a'=T 1 (y, c, u, »)], 

U = \iu u<w {Ey) y<w {Ev) v<w [w=x 1 {y,c,u,v)], etc. 

Theorem XXX (= Theorems XXVIII + XXIX). The following 
classes of partial functions are coextensive, i.e. have the same members: 
(a) the partial recursive functions, (b) the computable functions, (c) the 1/1 
computable functions. Similarly with l completely defined assumed functions 
X F. 


§ 70. Turing’s thesis. Turing’s thesis that every function which 
would naturally be regarded as computable is computable under his 
definition, i.e. by one of his machines, is equivalent to Church’s thesis by 
Theorem XXX. We shall now examine the part of the evidence for it 
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which pertains to the machine concept, i.e. what we listed as (C) in § 62. 
What we must do is to convince ourselves that any acts a human com¬ 
puter could carry out are analyzable into successions of atomic acts of 
some Turing machine. 

“The behavior of the computer at any moment is determined by the 
symbols which he is observing, and his 'state of mind’ at that moment.” 
The number of symbols which he can recognize is finite. "If we were to 
allow an infinity of symbols, then there would be symbols differing to 
an arbitrarily small extent.” (Turing 1936-7 pp. 249—250.) The work 
leading from the problem statement to the answer must be carried out 
in some “symbol space” (Post 1936 ), i.e. some systematic arrangement of 
cells or boxes, each of which may bear (an occurrence of) a symbol. 
There is a finite bound to the number of occurrences of symbols (or of 
boxes where a symbol may occur) which he can observe at one moment. 
He can also remember symbols previously observed, by altering his state 
of mind. However “the number of states of mind which need to be 
taken into account is finite. ... If we admitted an infinity of states of 
mind, some of them will be ‘arbitrarily close’ and will be confused.” 
(Turing 1936-7 p. 250.) But the computer’s action must lead from a quite 
discrete object, namely the symbol array representing some natural 
number (or »-tuple of natural numbers) as argument (s), to another such 
object, namely the symbol array representing the corresponding function 
value. The possible states of mind are fixed in advance of naming the 
particular argument (s), as we are considering computation by a preas¬ 
signed method, and do not allow mathematical invention in the midst 
of the computer’s performance. Each act he performs must constitute a 
discrete change in the finite system consisting of the occurrences of 
symbols in the symbol space, the distribution of observed squares in this 
space, and his state of mind. 

These limitations on the behavior of the human computer in computing 
the value of a number-theoretic function for given arguments, by follow¬ 
ing only preassigned rules, are of the same kind as enter in the con¬ 
struction of a Turing machine. The tape is the symbol space for the ma¬ 
chine, and the machine state corresponds to the computer’s state of mind. 

The human computer is less restricted in behavior than the machine, 
as follows: (a) He can observe more than one symbol occurrence at a 
time, (b) He can perform more complicated atomic acts than the machine, 
(c) His symbol space need not be a one-dimensional tape, (d) He can 
choose some other symbolic representation of the arguments and function 
values than that used in our definition of computability. 



378 


COMPUTABLE FUNCTIONS 


CH. XIII 


We shall examine various possibilities under (a) — (d), and see briefly 
how each can be reduced to an equivalent in terms of Turing machines. 
We shall usually speak as though only one were being reduced, but our 
methods would serve to reduce any combination of them successively. 

Under (a), we remark that e.g. 17 and 21 and 100 can each be ob¬ 
served in a single act. But a long sequence of symbols can only be 
observed by a succession of acts. For example, we cannot tell at a glance 
whether 157767733443477 and 157767733443477 are the same; “we 
should have to compare the two numbers figure by figure, possibly 
ticking the figures off in pencil to make sure of their not being counted 
twice.” (Turing 1936-7 p. 251.) 

If 17 and 21 and 100 are not only observed as units but manipulated 
as though each occupies a single cell of the symbol space, we need only 
redefine the symbols so that each of these constitutes a single symbol, 
in order to reduce the compound observation to a simple observation of 
the kind used by a Turing machine. 

In actual computing we sometimes use certain marks (accent, check, 
movable physical pointer, etc.), which may be placed on a given square 
in addition to an ordinary symbol. If there are j of the ordinary symbols, 
and n of these special marks, any subset of which may be placed on a 
given square, the number of the square conditions is merely increased 
from /+1 to (/+ 1 ) • 2 *. 

As another example of behavior involving compound observation, 
suppose that the following sequence of symbols is printed, 

...4401385789264..., 

that the observer's attention is centered at the figure 7 near the middle, 
and that he observes clearly at most five figures centered at this 7; thus 
the sequence of the five digits 85789 together with his state of mind 
determine his next act. Some digits further off may be vaguely observed, 
but without affecting his act. The act shall be of one of the kinds per¬ 
formed by Turing machines, with each separate symbol occurrence (not 
groups of five) occupying a square. For example, if the next act is 0Lq d , 
the printing becomes 

...4401385089264..., 

with 38508 observed. Such behavior can be reduced to Turing machine 
behavior as follows. Say that the symbols are the ten Arabic digits. 
The behavior can be considered as that of a generalized Turing machine, 
in which the configuration (determining the act) is ( e, /, a, g, h, q e ) 
where e, f, a, g, h are the digits occupying the five squares centered at 
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the scanned square. Corresponding to each state q c of this generalized 
machine, we introduce a set of 10 4 states q ce/eA (e, f, g, h = 0 , ..., 9 ); and 
we modify the table so that upon reaching state q c , a series of Turing 
machine acts is performed, consisting of inspections of the two adjacent 
squares on each side, leading to state q cefgh when the four squares in 
question are occupied by the respective digits e, /, g, h. Not only the 
states q cefgh must be added, but also some states to be assumed during 
the action leading from q c to q eefgh . Details are left to the reader. Now 
the act the generalized machine performed from the configuration 
(e, /, a, g, h, q c ) shall be performed from the configuration (a, q„ /flA ). 
This reduction is an illustration of the remark that one can remember 
a finite number of previously observed symbols by having changed one's 
state of mind when they were observed. 

It might be thought that the printing on still other squares may 
constitute part of the observation, e.g. that on certain specially marked 
squares (finite in number). If these squares are so located in the symbol 
space that the computer can find them and return by acts of the kinds 
performed by Turing machines (cf. the discussion of (c) to follow), this 
kind of corppound observation can be reduced in a similar fashion to the 
preceding. 

Under (b), the computer can alter other squares besides the scanned 
square. The new observed square need not be adjacent to the original. 
However there is a finite bound to the complexity of the act, if it is to 
constitute a single act of the computer. More complicated acts will require 
renewed motivation by reference to the observed data and the state 
of mind at intermediate situations between the given and resulting ones. 
(Indeed it can be argued that theTuring machine act is already compound; 
and consists psychologically in a printing and change in state of mind, 
followed by a motion and another change of mind. Post 1947 does thus 
separate the Turing act into two; we have not here, primarily because it 
saves space in the machine tables not to do so.) 

All simple alterations of the situation, not in the Turing machine form, 
which are readily proposed, e.g. printing after motion instead of before, 
are easily expressed as successions of the atomic acts of a Turing machine. 
(Much more complicated operations, which could hardly be regarded as 
single acts, have already been so treated in § 68 .) 

Turning to (c), computing is commonly performed on 2-dimensional 
paper, and the 2 -dimensional character of the paper is sometimes used 
in elementary arithmetic. Theoretically, we must also consider the pos¬ 
sibility of still other kinds of symbol space. The symbol space must be 
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sufficiently regular in structure so that the computer will not become lost 
in it during the computation. 

From a given square or cell of the space, there will be a finite number 
m +1 of ways of moving to the same or an adjacent cell, call them 
M 0 , ..., M m where M 0 is the identical motion. For example, in the plane 
ruled into squares, m — A (no motion, left, up, right, down), or if diagonal 
motions are also allowed, m = 8 . The computer, whose act from a given 
situation must be determined by which one of a finite number of con¬ 
figurations is existing, could not use more. We lose no generality in 
supposing that there are the same number of directions of motion from 
every cell; in case there are fewer from some cells, the terminus of the rest 
of the ra-j -1 motions may be defined to be the given cell, i.e. these as well 
as M 0 may be taken to be identical. 

The number of cells which can ultimately be reached is therefore 
countable. The same cell may be reached by different successions of 
motions, e.g. in the plane, down and then right leads to the same square 
as right and then down. 

We shall suppose that an enumeration without repetitions can be 
given of all the cells, such that the following is the case. To each of the 
ways of moving M ( (i ±= 0, .. m), there is a computable function (i,- 
such that, if a is the index in the enumeration of the given cell, then 
[Li(x) is the index of the cell reached by the motion M t . This supposition 
is realized by any readily imagined symbol space. 

Using this enumeration, let the cell numbered x in the enumeration 
(x = 0, 1,2, ...) correspond to the x-th. square counting rightward from 
a certain square (called the 0 -th) on a linear tape. 

Using methods from § 68 , we can set up a Turing machine which will 
find the fi. t (*)-th square, when started on the x-ih square, if a dis¬ 
tinguishing mark is kept on the 0-th (or - 1 -st) square. The computation 
for this purpose can be done by marking squares with accents, afterwards 
erased, without interfering with the printing already on them. This 
enables us to reduce computation in the given symbol space to com¬ 
putation on the linear tape of a Turing machine. 

For this reduction, we did not assume that from any cell adjacent to 
a given cell one of the motions returns us to the given cell, i.e. that every 
motion in the space has an inverse. This would be the case in any or¬ 
dinary symbol space. An exception is represented by the computer who 
receives a signal at intervals by ear. 

The symbol space may consist of several disconnected subspaces, 
each having its own scanned cell, as e.g. in the case of a computer who 
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simultaneously reads a symbol on a paper by eye, reads another in braille 
on a tape by hand, and receives a signal by ear. If there are r such 
subspaces, we can reconstrue the cells to be the r-tuples consisting 
of a cell from each of those respective subspaces. 

In regard to (d), we may argue that a natural number y is given in the 
original sense (§ 6 ), only if some sequence of y +1 objects, say y +1 
tallies, is given; and hence that a procedure for computing a function 9 
from its argument (s), when both are expressed in some other notation, 
would not solve the computation problem for 9 , unless the computer 
can also proceed from the other notation for a number y to the sequence 
of y +1 tallies, and vice versa. 

According to our other arguments, Turing machines could then be 
built which, given the other notation for y would supply the y +1 tallies, 
and vice versa. Details can be arranged as in the definition of compu¬ 
tation within one system of notation. Thus for decimal notation, the 
first machine started in the first of the following situations would go 
to the second. 
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For the familiar systems of notation, such as the dual or decimal, the 
existence of such a pair of machines can be established. 

We have been defending Turing’s thesis for number-theoretic functions; 
but Turing machines apply equally well to expressions in any language 
having a finite list of symbols. By using them as just illustrated for the 
case of converting one notation for a natural number into another, we 
get a direct way of characterizing ‘effective’ operations on expressions 
in such languages, as an alternative to requiring a corresponding number- 
theoretic function under a particular effective Godel numbering to be 
general recursive or computable (§61). The method extends to languages 
having an enumerable infinity of symbols, whenever the symbols can 
be considered effectively as composed in turn from the symbols of some 
finite list; e.g. to the formal number-theoretic symbolism, by regarding 
the variables a, b, c, ... as a, a,, a„, ... (§§16, 50). 
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*§ 71. The word problem for semi-groups. Church’s original 
example of a decision problem which, on the basis of his thesis, is un- 
solvable was constructed in terms of X-def inability ( 1936 ). The corre¬ 
sponding example given above (Theorem XII § 60) is constructed in terms 
of general recursiveness (following Kleene 1936 , 1943 ). Turing 1936-7 gave 
examples constructed in terms of computability. This can be done e.g. 
as follows. A machine 2JI is determined by its table, containing {j+l)k 
entries. We can set up a Godel numbering of the tables, and thus of the 
machines. Then the function £ defined as follows is not computable: 
£(x) = 0, if x is the Godel number of a machine W x , and the partial 
function cp x of one variable which W x computes has 1 as value for x as 
argument, and £(. x ) = 1 otherwise. So by Turing’s thesis (or via the 
equivalence of general recursiveness and computability, by Church’s 
thesis) there is no algorithm for deciding whether any given number y 
is the Godel number of a machine which, when started scanning x in 
standard position with the tape elsewhere blank, eventually stops 
scanning x, 1 in standard position. As another example, there is no al¬ 
gorithm for deciding whether any given machine, when started from any 
given initial situation, eventually stops. For if there were, then, given any 
number x, we could first decide whether x is the Godel number of a 
machine ffl x , and if so whether Wl x started scanning x in standard position 
with the tape elsewhere blank eventually stops, and if so finally whether 
x, 1 is scanned in standard position in the terminal situation. 

These first examples of decision problems proved unsolvable are 
problems arising directly in connection with one of the mathematical 
notions (X-definability, general recursiveness, or computability) originally 
identified with effectiveness by the Church-Turing thesis. 

A second class of examples, a step removed from these, are the 
decision problems for certain formal systems, e.g. Theorem 33 §61 
(also cf. § 76, Church 1936 p. 363). 

A decision problem proved unsolvable by Post 1947 and Markov 1947 
is of interest as constituting the first example in which an existing problem 
from outside the field of logic and foundations has been treated. 

The problem which Post and Markov prove unsolvable was proposed 
by Thue 1914 . Suppose that a finite list a 1( ..., a m (m > 1 ) of distinct 
symbols is given; let us call them letters, and the list of them the alphabet. 
A finite sequence of zero or more (occurrences of) the letters, we call a 
word (in, or formed from, that alphabet); in the language of Chapter IV 
(§ 16), a word is simply a formal expression, when a x , ..., a m are the 
formal symbols, except that now we always include the empty expression. 
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A word C is part of another word D, if D is of the form UCV where U 
and V are words (possibly empty). 

Now suppose further that a finite list (A x , Bj), ..(A„, B„) ( n ;> 1 ) 
of pairs of words is given; we call this list the dictionary. We say that 
two words R and S are immediately equivalent (by the given dictionary), 
if R and S are of the respective forms UA f V and UB,V, or of the respective 
forms UB,V and UA,V, for some words U and V and pair (A,-, B f ) 
(1 < i <, n) ; in other words, if R is transformable into S by replacing a 
part A i by its correspondent B t in the dictionary, or inversely. We 
call two words P and Q equivalent (by the given dictionary), if there is a 
finite sequence R 1( . .., R t (/ ;> 1 ) of words such that R x is P, and Rj is Q, 
and R (-1 is immediately equivalent to R, (t = 2, ...,/). 

Thue’s (general) problem is to find an algorithm for deciding, for any 
given alphabet and dictionary, whether any two given words are equiv¬ 
alent. The problem is also known as the word problem for semi-groups. 

Theorem XXXI. The word problem for semi-groups is unsolvable ; 
in fact, there is a particular alphabet and dictionary, such that there is no 
algorithm for deciding whether any two words (formed from that alphabet) 
are equivalent (by that dictionary). (Post 1947 , Markov 1947 .) 


Proof. Our method of proof will consist in picking an alphabet and 
a dictionary, such that, if we had a decision procedure for the equivalence 
of any two words, we could thence obtain a decision procedure for the 
predicate (£; )T x (x, x, y), contradicting Theorem XII §60 (based on 
Church’s thesis). The unsolvability of Thue’s problem for the case of this 
particular alphabet and dictionary of course implies the unsolvability of 
the general problem. 

It is convenient now to consider the equivalence relationship between 
two words in terms of the possibility of formally deducing the second 
word from the first by use of 2 n rules of inference, as follows (i — \, ... ,n): 
, , UA,V UB,V 

( a ) tttTtT' ( b ) 


UB,V, 


UA t V, 


where U and V are any words (possibly empty) in the alphabet a x ,..., a m . 

We shall first study semi-Thue systems in which only the n rules of 
inference (a) are admitted, but not their inverses (b). 

We shall describe a method by which, given any particular Turing 
machine, we can set up a semi-Thue system such that the tape vs. 
machine situations are represented by words of the semi-Thue system, 
and the atomic acts of the machine correspond to applications of the 
rules (a). 
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The alphabet of the semi-Thue system shall consist of s 0 , ..., s, 
(representing the square conditions for the machine), q 0 , (rep¬ 

resenting the machine states), and an additional symbol h (/+£+3 
letters altogether). 

In a given tape vs. machine situation, say that the smallest unbroken 
piece of tape containing all printed squares and the scanned square consists 
of r squares (r > 1 ). Let the conditions of these squares from left to 
right be s ffl , ..., s Br . Let the p-th of these squares be the scanned square 
(1 < p <L r). Let the machine state be q c . Then the situation shall be 
represented by the word 

hs ffl ... s ap T ^up - i • • ■ 

which we call the Post word for it. (When is a word a Post word?) 

Example 1 . The Post word for the situation of Example 1 § 69 
is hs 1 s 1 s 3 s 1 q 4 s 0 s 1 h. 

The n rules of inference (a) of the semi-Thue system shall comprise 
one or more rules corresponding to each of the (;’+\)k active configu¬ 
rations of the Turing machine. If the table entry for an active con¬ 
figuration (s a , q c ) is of the form s 6 Zq d with b ^ 0 , there shall be j- j-2 
corresponding rules as follows (e = 0 , ...,/), 

Us e s a q c V Uhs a q c V 

Us e q d s 6 V, Uhs 0 q d s 6 V; 

if the table entry is s 0 Lq d , there shall be (;+ 2) 2 rules as follows 

(<?,/ = 0, ..., j), 

Us t s g q e s/V Uhs a q c s / V Us e s„q c hV Uhs a q c hV 

Us.q^SoS^V, Uhsoq^S/V, Us e q d hV, Uhs 0 q d hV. 

Similarly there shall be /+2 rules, if the table entry is of the form s b R<\ d 
with b ^ 0; and (7 -f-2) 2 , if of the form s 0 Rq a . If the entry is of the form 
Si,Cq d , there shall be one rule, as follows, 

Us a q c V 

Us 6 q d V. 

In these n rules the A/s are all distinct. Given the Post word for an 
active situation, exactly one of these n rules (a) is applicable to it as 
premise, namely the rule or one of the rules corresponding to the con¬ 
figuration (si, q c ) in that situation, and in only one way, i.e. with only 
one choice of the U and V. (For a rule in which the first symbol of the 
A,- is h, the U will always be empty.) The application of the rule gives as 
conclusion the Post word for the situation resulting by the atomic act 
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of the machine from the given situation. Given the Post word for a passive 
situation, none of the n rules is applicable. 

Hence a given word Q is deducible from a given Post word P in the 
semi-Thue system, if and only if Q is the Post word for a situation which 
the Turing machine will reach from the situation represented by the 
Post word P. 

The partial recursive function 0 • \iyT\{x, x, y) is defined and has the 
value 0, if and only if (EyjT^x, x, y) (cf. §63). By Theorem XXVIII, 
there is a Turing machine which 1/1 computes 0-y.yT^x, x, y). We set up 
the semi-Thue system corresponding to this machine. In this semi-Thue 
system, from the Post word for the situation in which a number x is 
scanned in standard position with state q x and the tape is blank elsewhere 
(i.e. the Post word hs x ... s^h with x+1 occurrences of Sj), we can 
deduce the Post word for the situation in which the number pair x, 0 
is scanned in standard position with state q 0 and the tape blank else¬ 
where (i.e. the Post word hs x ... s^s^h with *+1 occurrences of s x 
preceding the s 0 ), if and only if (£y)7' 1 (x, *, y). Since by use of Church’s 
thesis there is no algorithm for deciding whether (EyjT-^x, x, y) (Theorem 
XII), there can be no algorithm for deciding, for any two given words 
P and Q in the semi-Thue system, whether Q is deducible from P, i.e. 
whether Q follows from P by the rules (a). 

To establish the theorem, it remains to extend this result to the full 
Thue system in which the inverse rules (b) are admitted. This is ac¬ 
complished by the following lemma. 

Lemma VII. For the rules (a) corresponding to a given Turing machine 
as described above : If P is a Post word, Q is deducible from P by the rules 
(a) and (b), and Q contains q 0 , then Q is deducible from P by the rules (a) 
only. 

Proof of Lemma VII, by course-of-values induction on the length 
l of a given deduction of Q from P by the rules (a) and (b). Let the de¬ 
duction be Rj, ..., Rj, where R t is P and R ( is Q. The case for / = 1 
is trivial, and we now suppose l > 1. Since P is a Post word, and the 
rules (a) and (b) each preserve this property, each of R x , ..., R t is a 
Post word, and hence contains exactly one occurrence of a q, i.e. of one 
of q 0 , .... q*. Now Rj, i.e. Q, contains q 0 ; and by the choice of the rules 

(a) to correspond to the active configurations of the Turing machine, 
each of the A/s contains q c for some c =£ 0. Hence R f must come from 
R;-i by one of the rules (a). So if there are any applications of the rules 

(b) in the given deduction, the last will be in the step from R f _ x to R t 
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for some t < l. Then R (+1 comes from R* by one of the rules (a); but also 
since the rules (b) are the inverses of the rules (a), R f-1 comes from R, 
by one of the rules (a). But R ( is a Post word, and to such a word at 
most one of the rules (a) is applicable and in at most one way. Hence 
R*_! and R (+1 are the same word. Hence we can shorten the given de¬ 
duction of Q from P by omitting R ( R, +1 . Applying the hypothesis of the 
induction to the shortened deduction of Q from P by the rules (a) and (b), 
we conclude that there is a deduction of Q from P by the rules (a) only. 

It is easily seen that the word problem for semi-groups is equivalent 
via Godel numbering to a problem of the form whether (Ey)R(x, y) is true 
for given x, where R is general recursive. Since our proof of Theorem 
XXXI is by reducing the problem whether (Ey'jT^x, x, y) to the word 
problem, the word problem is of highest degree of unsolvability for 
decision problems of predicates of the form ( Ey)R(x, y) (cf. end § 61 and 
Example 2 § 65). 

Some further results along the same line as Theorem XXXI are 
contained in Markov’s papers 1947 , 1947 a, 1947 b, 1951 , 1951 a, 1951 b. 
Also cf. Hall 1949 and Boone 1951 abstract. 

The definition of equivalence of two words P and Q (in symbols, 
P ~ Q) which we used above in the word problem for semi-groups can 
be expressed inductively thus, where (extremal clause) P ^ Q only 
as required by the following (direct clauses): 1 . A* ~ B f (* = 1, .. n). 
2 . U ~ U. 3. If U ~ V, then V ~ U. 4. If U ~ V and V ~ W, then 
U ~ W. 5—6. If U ~ V, then Ua* ~ Va,- and a f U ~ a 4 V (» = 1, .. *n). 
Turing 1950 * shows that the word problem for semi-groups with cancellation, 
which we obtain by adding the two following clauses, is likewise un- 
solvable: 7—8. If Ua* ~ Va* or a,U ~ a^, then U ~ V (i = 1, . • rn). 
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THE PREDICATE CALCULUS AND AXIOM SYSTEMS 

§ 72. Gddel’s completeness theorem. We resume the study of 
the predicate calculus, continuing from the point reached in § 37. Say 
that F is a predicate letter formula containing free only the distinct 
variables z v ..., z q (q ^ 0) and containing only the distinct predicate 
letters P x , ...,P, (s ^ 1). An assignment of objects z v from 

some non-empty domain D as values to z x , ..., z„ and of logical functions 
P v ..., P, over D as values to P x , ..., P, we say satisfies F (or is a sat¬ 
isfying assignment to z v ..., z q , P x , ..., P, for F), if under the valuation 
rules given in §§ 28, 36 and 37 F then takes the value t. As defined in 
§ 37, F is satisfiable {valid) in a non-empty domain D, if some (every) 
assignment to z v ..., z„ P x , ..., P, in D satisfies F. As to the 
notation, we are now writing the logical functions “Pj{a v . ..,<*«,)”> 
"A{a, b )”, etc., instead of ‘T x (a x , ..., a W/ )”, ”\{a, b)”, etc. as we did in 
§§ 36, 37. In the case that the domain is the natural numbers, logical 
functions are simply number-theoretic predicates, when we do not 
make a distinction between propositions and truth values t or f (cf. 
(b) § 45 and remarks there); and indeed we shall sometimes call them now 
predicates. 

Theorem 34 oC . If a predicate letter formula F is irrefutable (i.e. if 
~iF is unprovable, § 41) in the predicate calculus, then F is satisfiable in 
the domain of the natural numbers. (Godel’s completeness theorem for the 
predicate calculus, 1930 .) 

Modifications of Godel’s proof appeared in the 2 nd (1938) edition of 
Hilbert-Ackermann 1928 and in Hilbert-Bernays 1939 . Henkin 1949 gave 
a proof employing a minimum of knowledge of the deductive properties 
of the predicate calculus. We give a proof which is intermediate in this 
respect between Hilbert-Bernays’ and Henkin’s. There is also a proof by 
Rasiowa and Sikorski 1950 using algebra and topology. 

Proof of Theorem 34 (preliminaries). By Theorem 19 §35, any 
predicate letter formula F is equivalent to a prenex predicate letter 
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formula, which (by the method of proof) has the same distinct free 
variables z x , .. z q and predicate letters P 1( ..., P s as F. By Theorem 
21 § 37 and the valuation table for ~ § 28, or by paralleling the proof of 
Theorem 19 set-theoretically, the prenex form of F is satisfied by a given 
assignment z v ..z v , P x , .. ., P s , if and only if F is satisfied by it. 

For the remainder of the proof (including Lemmas 22 and 23), we 
shall assume that F is prenex. For illustration, suppose that F is 
Vx x 3y x Vx 2 Vx 3 3y 2 Vx 4 B(z x , z 2 , x v y x , x 2 , x 3> y 2 , x 4 ) where B(z x , z 2 , x x , y x , 
x 2 , x 3 , y 2 , x 4 ) contains no quantifiers and only the distinct variables 
shown (so q = 2). 

Instead of speaking of the value which a predicate letter formula 
A(u x , ..., up) takes when its free variables and predicate letters u x , ..., Uj,, 
P x , ..., P s take respectively the values w x , ..., u v , P x , ..., P s , it will 
usually be more convenient notationally to permit the substitution of 
numerals for the free variables of predicate letter formulas, and then to 
speak of the value taken by A(u x , ..., u„) (where u x , .. ,,u v are the 
numerals for the natural numbers u v . .., u p ) when P x , ..., P 3 take the 
values P x , . . P s . (Here A(u x , ..., **„) is a predicate letter formula with 
numerals in the sense obtained by extending the notion of ‘^-predicate 
letter formula’ § 37 to allow all numerals and not simply 1, ..., k (and 
variables) as terms. A predicate letter formula with numerals containing 
no variables free or bound is a proposition letter formula with numerals.) 

We can then treat the problem of choosing the logical functions 
P x , ..., P s as a problem of choosing a value (t or f) for each of the formulas 
P ; (a x , ..a».) where j — 1 , ..., s and a v .. ., a n , range over all n r 
tuples of natural numbers. Let these formulas be enumerated in some 
manner without repetitions as Q 0 , Q x , Q 2 , .... 

By the valuation rules for V and 3, F is satisfied by z v z 2> P x , ..., P„ 
if for each natural number x lt there is some natural number y x depending 
on x x (write it “yi(*i) ”)< suc h that for each x 2 and x 3 , there is some y 2 
depending on x v x 2 , x 3 (write it “y 3 (x x , x 2 , x 3 )”), such that for each x it 

(I) B(z x , z 2 , x v y^xj, x 2 , x 3 , y 2 (x v x 2 , x 3 ), x 4 ) 

(where,y x (x x ) is the numeral for the natural number y x (x x ), etc.) has the value 
t. We now takez x , z 2 , y x (# x ), y 2 (x x , x 2 , x 3 ) to be 2° • 3 1 , 2° • 3 2 , 2 1 • 3 Xl , 2 2 • 3 Xl • 
5 12 • 7*h respectively. Thus we determine an infinite class F 0 of proposition 
letter formulas with numerals (namely the formulas (I) when x v x 2 , x 3 , x i 
range over all quadruples of natural numbers, and z v z 2 , yx(x x ), y 2 (x x , 
x 2 , x 3 ) are as just specified) such that if P x , ..., P s jointly satisfy these 
formulas, i.e. give them all the value t, then z v z 2 , P x , ..., P, satisfy F. 
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We say that a class of formulas is consistent in a formal system S, if 
the formal system obtained by adjoining the formulas of the class as 
axioms to the postulates of S is simply consistent (§ 28), i.e. if for no 
formula A are both A and -i A deducible in S from formulas of the class. 

The proof of Theorem 34 will be completed by Lemmas 22 and 23, 
which relate to any prenex predicate letter formula F and the class 
F 0 obtained from it as illustrated. 

Lemma 22° c . If F 0 is consistent in the propositional calculus, then F is 
satisfiable in the domain of the natural numbers. 

Proof. By the preliminaries, it will suffice to show that the formulas 
of F 0 can be jointly satisfied under the valuation procedure of the prop¬ 
ositional calculus by an assignment of truth values (t or f) to each of 
Qo> Qi> Q 2 > • • • • (O ur method now will show this for any class F 0 of 
formulas consistent in the propositional calculus and list Q 0 , Q x , Q 2 , 
... of distinct formulas prime for the propositional calculus including 
all components of F 0 prime for the propositional calculus, § 25.) 

Let R 0 be Q 0 or -iQ 0 according as F 0 |- Q 0 or not F 0 |- Q 0 in the 
propositional calculus (where “F 0 b” means deducible from formulas 
of F 0 ), and adjoin R 0 to F 0 to obtain a new class F x . Then F x is also consistent 
in the propositional calculus. For in the first case (i.e. when F 0 (- Q 0 
and R 0 is Q 0 ), the addition of R 0 does not increase the class of the formulas 
which can be deduced; and in the second case (i.e. when F 0 b Qo an d 
R 0 is —iQo), if F x f- A and F x f- —iA for some A, then F 0 . iQo f- A 
and F 0 , -iQ 0 \- -iA, and thence by -t-introd. and elim., F 0 b -v-iQ 0 
b Q 0 , contradicting the case hypothesis. 

Similarly for each natural number i, let R* be Qi or -i Qi according as 
F i b Qi or F i b Qi, and adjoin Ri to Fi to obtain F t+1 ; then the con¬ 
sistency of F f+1 follows from that of F i . 

We now assign to Qi the value t or f according as Rj is Qi or Ri is -i Qi. 

For this assignment, not only R 0 , R x , R 2 , ... but also the formulas 
of F 0 take the value t. For let H be a formula of F 0 . The distinct parts of 
H prime for the propositional calculus belong to the list Q 0 , Q x , Q 2 , ...; 
say they are Qi, ..., Qi,. Consider the formula H & Ri & ... & Ri,; call 
it “A”. Let i = l-bmax(* x , ...,»,); then Fi b A, since H, Ri, ..., Ri, 
all belong to Fi. The assigned values are the only ones for which R*, 
..., Ri, are all t. Hence if H were not t for this assignment, then A would 
be identically false, and nA would be identically true (§ 28); so by Theo¬ 
rem 10 § 29 -i A would be provable in the propositional calculus, a fortiori 
Fi b A, which with F; b A would contradict the consistency of Fi. 
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Lemma 23. If F is irrefutable in the predicate calculus, then F 0 is 
consistent in the propositional calculus. 

Proof. Note that by our choice of the numbers z x , z 2 and functions 
y x (x x ) and y 2 (*i. x 2 > x 3 ) '■ (A) The numbers z x and z 2 , y x (x x ) for x x 0, 

1 , 2, ..., and y 2 {x x , x 2 > x z) f° r x i> x i< x z — 0> L 2 , ... are all distinct. 
(B) x x < y x (x x ) and x v y x {x x ), x 2 , x 3 < y 2 (x x , x 2 , x 3 ). 

To show by reductio ad absurdum that F 0 is consistent, suppose that 
for some A, both A and -iA are deducible in the propositional calculus 
(using predicate letter formulas with numerals) from formulas of F 0 . 
Then using weak -i-elim., there is a deduction of the formula 
from the same formulas, i.e. from a finite set of formulas of the form (I). 

Let “0”, “T”, “2”, ... denote variables distinct from each other and 
from x 1; y x , x 2 , x 3 , y 2 , x 4 , but such that z x is z x and z 2 is z 2 . By changing 
each numeral u where it occurs in this deduction not as a part of another 
numeral to the corresponding variable u, we obtain a deduction of 
<£7 & -i <37 in the (predicate letter) propositional calculus, a fortiori in the 
(pure) predicate calculus with all variables held constant, from a finite 
set of formulas of the form B(z x , z 2 , x x , y x (x x ), x 2 , x 3 , y 2 (x x , x 2 , x 3 ), x t ) ; and 
thence by use of V-elim. (noting that x 4 is distinct from the other variables 
shown), from formulas of the form 

Vx 4 B(z x , z 2 , x x , y x {x x ), x 2 , x 3 , y 2 (x v x 2 , x 3 ), x 4 ). 

Thus writing out the distinct assumption formulas (say there are l of 
them), 

Vx 4 B(z x , z 2 , x\, y x [x\), x\, x\, y 2 {x\, x\, x\), x 4 ), 

^ Vx 4 B(z x , z 2 , x\, y x {x f), x\, x\, y 2 (x\, x\, x\), x 4 ), 

Vx 4 B(z x , z 2 , x \, y x (x\), x\, x\, y 2 (x[, x\, x\), x 4 ) b T? & -i €1. 

The formula & -i <3? contains no variables. The variables which 
appear free rightmost in the assumption formulas of (1) are 
y 2 (x\, x\, x\), ..., y 2 (x\, x\, x l 3 ). Since the l assumption formulas are 
distinct, i.e. x[, x{, x{ for j = 1 , ...,/ are distinct triples of natural 
numbers, by (A) these l variables are distinct from each other as well as 
from z x , z 2 . By (B), we can choose one of them, say y 2 (x\, x\, x\), which is 
of greater index (i.e. comes later in the list 0, T, 2, .. .) than any of 
the variables which appear free elsewhere in (1) (i.e. not rightmost) 
except perhaps z x or z 2 , and hence is distinct from those variables also. 
By 3-elirn., followed by a change of bound variables (*74 § 33, noting 
that y 2 is distinct from the other variables shown) and two V-eliminations 
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(noting that x 2 , x 3 are distinct from each other and the other variables 
shown), 

Vx 2 Vx 3 3y 2 Vx 4 B(z 1 , z 2 , x\, y^x]), x 2 , x 3 , y 2 , x 4 ), 

^ vx 4 b(z x , z 2> %i> yi{A)< x l> y^A, x \> x \)> 

Vx 4 B(z 1 , z 2 , x\, yi{x\), x\, x\, y 2 {x\, x\, x l 3 ), x 4 ) h 

No variables are varied, since the variable y 2 (x\, x\, x\) of the 3-elim. 
does not occur in any of the other assumption formulas (cf. Lemma 7b 
§24). 

Again by (A) the variables y^x]), y 2 (xf, x\, x\), .... y 2 (x[, x\, x\) which 
appear free rightmost are distinct from each other as well as from 
z x and z 2 . By (B) we can choose one of them which is of greater index 
than any of the variables appearing free elsewhere in ( 2 ) except 
z x and z 2 , and so must be distinct from those also. This variable may be 
y x (*}) or say y 2 (xf, x\, x\). If it is y x {x\), by 3-elim., *74 and V-elim. we 
replace the first assumption formula by Vx 1 3y 1 Vx 2 Vx 3 3y 2 Vx 4 B(z 1 , z 2 , x x , 
y x , x 2 , x 3 , y 2 , x 4 ), i.e. by F. If it is y 2 (x\, x\, x\), we instead replace the 
second assumption formula by Vx 2 Vx 3 3y 2 Vx 4 B(z 1 , z 2 , x\, y 1 (^j), x 2 , x 3 , y 2 , 
x 4 ), and if this is a duplicate of the first (i.e. if x\ = x\) we omit it. 

We continue in this manner. After each use of 3-elim. (applied to a 
y appearing rightmost and distinct from all the other variables shown) 
and *74, we apply V-elim. to the x’s which are "uncovered” by the 3-elim., 
and then omit the resulting assumption formula, if it is a duplicate of 
another, so that the y’s which appear rightmost at the next stage will 
again be distinct from each other. Eventually we obtain simply 

(3) F H <37&-ie2. 

From (3) by &-elim. and -i-introd., 

(4) b -iF, 

contradicting the hypothesis of the lemma that F is irrefutable. 

Corollary 1 oC . Every predicate letter formula G which is valid in the 
domain of the natural numbers is provable in the predicate calculus (and 
hence, by Theorem 21 §37, is valid in every non-empty domain ). (Another 
version of Godel’s completeness theorem, 1930 .) 

Proof. (G is valid in the domain of the natural numbers} 
-> {~iG is not satisfiable in that domain} {-iG is refutable, i.e. 
~1—1G is provable, in the predicate calculus} [by the theorem, applied 
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with -iG as the F, and contraposed (cf. *14 § 26)] -> {G is provable in 
the same} [by -i-elim]. 

Corollary 2 c . If a predicate letter formula F is satisfiable in some 
(non-empty) domain, then F is satisfiable in the domain of the natural 
numbers. (Ldwenheim’s theorem, 1915 , also called the Lowenheim- 
Skolem theorem.) 

Proof. By contraposing Corollary 1 ; or thus: {F is satisfiable 
in some domain} ->■ {-iF is not valid in that domain} {-iF is not 
provable, i.e. F is irrefutable, in the predicate calculus} [by Theorem 
21 contraposed] ->■ {F is satisfiable in the domain of the natural numbers} 
[by the theorem]. 

Lowenheim’s proof of his theorem, and the simpler proof given by 
Skolem 1920 , employ the set-theoretic axiom of choice (Zermelo 1904 , 
cf. § 13). The proof via Godel’s theorem is non-constructive to a lesser 
degree. The non-intuitionistic step (in the present treatment) occurs in 
the proof of Lemma 22, where we assume that F 4 (- Q, or F ( 1 - Q,. 
By formalizing a part of their proof of Godel’s completeness theorem 
which contains the non-constructive step, Hilbert and Bernays obtain 
a metamathematical completeness theorem for the predicate calculus 
(1939 pp. 252—253), which we shall formulate as Theorem 36. 

Theorem 35 oC . The satisfying predicates P v .. .,P S for F in Theorem 
34 can be chosen so that Pj(a v .... a nj ) = ( Ex)(y)R j (a 1 , .. a njt x, y) 
(x)(Ey)S j (a 1 , ..., a n ., x, y) where R, and Sj are primitive recursive 
(j = 1 , ..s). 

Proof. In the following, “recursive” can mean general recursive 
(though actually the statements hold in the meaning primitive recursive); 
then by Corollary Theorem IV § 57, we can take Rj and in the con¬ 
clusion to be primitive recursive. 

Suppose we have set up a Godel numbering of the proposition letter 
formulas with numerals by the methods of §§ 52 and 56. Then if a and b 
are the Godel numbers of formulas A and B, respectively, that of A D B 
is n(«, b) and of - 1 A is v(a), for certain recursive functions 9 . and v. Let 
H(a)' = {a is the Godel number of a formula belonging to F 0 }; then H is 
recursive. If the enumeration Q 0 , Q 1( Q 2 , ... was chosen suitably, the 
following functions will be recursive: x.(i) = {the Godel number of QJ, 
*j( a v • • •» a n,) = {the i such that P^aj, ..., a n/ ) is Q,}. 

By the definition of the satisfying predicates P ; as given in the proof 
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of Lemma22, if P,(a 1( . is Q,-, then Pj(a v ..., a n f) = {R, is Q<} = 

{F f H QJ- Let F{i, a) = {a is the Godel number of a formula A such that 
F t - 1 - A}. Then 

(a) Pi (a v .. ., a nj ) == F{a. j {a l , ,.., a„ ; ), x(oc f (a v .... a nj ))). 

Now we shall investigate the predicate F(i, a). 

To say that F 0 \- A in the propositional calculus means the same 
as that A is provable in the formal system obtained by adjoining F 0 as 
axioms to the propositional calculus. Let F 0 (a) = {a is the Godel number 
of a provable formula of this system}. By the methods of §§51, 52 (cf. 
Dnl 2 ), since H is recursive, F 0 (a) = ( Ey)R{a , y) with a (primitive) re¬ 
cursive 1 R; i.e. F 0 (a) is expressible in the existential 1 -quantifier form of 
Theorem V Part II § 57. 

By the definition of F i+1 from F, and the D-rules, {F i+1 |- A)s (F,-, 
R, h A} == {F* f- R^ O A}. Taking into account the definition of R< 
by cases, we thus have 

I F(0, a) B F 0 (a), 

\ F(i', a) = [.F(i , x(0) & F(i, p(>c(*), a))} V [F(», x(*)) & F(i, {i(v(x(i)), a))]. 

This shows that F is recursive in F 0 ; for applying #D § 45, and going 
over from the predicates F and F 0 to their representing functions cp and 
<p 0 , we have a "nested recursion” (§ 55), 

r 9 ( 0 , a) = <p 0 (fl), 

\ 9 (i v , a) = x(?(b x(*)), 9 (*, |i(x(»), a)), 9 (*, fx(v(x(f)), «))), 

where x» R> ^ and v are (primitive) recursive. (In fact by the result of 
Peter 1934 cited in § 55, F is primitive recursive in F 0 .) 

By (a), Pj is recursive in F, a„ x; hence since a, and x are recursive, in 
F ; and hence in F 0 , which is expressible in the existential 1 -quantifier 
form. Therefore, by a theorem of Post (Theorem XI § 58), P 3 is ex¬ 
pressible in both the 2-quantifier forms of Theorem V, as was to be shown. 

Theorem 36°. The addition to the postulate list for the predicate 
calculus of an unprcvable predicate letter formula G for use as an axiom 
schema would cause the number-theoretic system as based on the predicate 
calculus and Postulate Group B (§19) to become < 0 - inconsistent (§ 42). 
{In fact, a certain formula would become refutable which expresses a true 
proposition of the form ( y)D{y) where D(y) is an effectively decidable pred¬ 
icate.) (Hilbert-Bernays completeness theorem, 1939 .) 

Note the partial analogy to Corollary 2 Theorem 10 § 29 for the prop¬ 
ositional calculus. 
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Method of proof. Without loss of generality, we can take G to be 
closed, so that q = 0 (cf. end § 32). Let F be a prenex form of -i G, so that 
by Theorem 19 with *30 and *49, 

(i) h G ~ —i F in the predicate calculus. 

By (i), since G is unprovable, so is -iF, i.e. F is irrefutable. 

Hence by Lemma 23 (the proof of which was finitary), 

(ii) F 0 is consistent in the propositional calculus. 

The consistency of F 0 is equivalent to the proposition that F 0 \- d & -i<3. 
If r be the Godel number of d & -i d, then F 0 d tk—\d = (Ey)R(r, y) 
as (y)R{r, y) for the primitive recursive R(a, y) used in the proof of Theo¬ 
rem 35. Let D(y) = R(r, y). Then D(y) is primitive recursive, and (ii) is 
equivalent to 

(»*) (y)D(y). 

By Corollary Theorem 27 § 49, D(y) is numeralwise expressed in the 
number-theoretic formalism by a formula D(y). Then from (ii a ), 

(Hi) (y)[h DM] 

in the number-theoretic formalism. 

According to Lemma 22 , classically, 

{F 0 is consistent in the propositional calculus} -> 

1 {F is satisfied by certain predicates P v ..., P s }. 

The premise of the implication (iv) can be expressed in the symbolism 
of the number-theoretic system by the formula VyD(y). The conclusion, 
can also be expressed in the number-theoretic symbolism, when we use 
the expressions for P v ..., P s given in Theorem 35. Let R^a^ ..., a n ,-, x, y) 
numeralwise express R,(a 1( ..., a nj , x, y). Then 3xVyR,(a x , ..., an,, x, y) 
expresses Pj{a v ...,a nj ) in the symbolism. The proposition that F is 
satisfied by P v ..., P s is then expressed by the formula F* which we 
obtain from F by substituting the formulas 3xVyR,(a 1( ..., a. nj , x, y) 
(j — 1 , ..., s) for the respective predicate letters Pj(a 1( .... a n ,.) (assuming 
that the bound variables have been suitably chosen). Then the im¬ 
plication (iv) is expressed by the formula VyD(y) D F*. 

We now propose that the informal classical demonstration of (iv) 
should be formalized in the classical number-theoretic system as a proof 
of this formula, so that we should then have 

(v) h VyD(y) D F* 
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in that system. By contraposition (*12 §26), 

(vi) |- -iF* ID -iVyD(y). 

From (i) by substitution (Theorem 15 § 34), 

(vii) 1 - G*~-iF*. 

Using (vii) in (vi), 

(viii) f- G* D -iVyD(y). 

In (iii) and (viii), the “b” refers to the unaugmented number-theoretic 
formalism. Now if G be added to this system as an axiom schema, then 
G* becomes an axiom (by the new schema), and then (since (iii) and (viii) 
still hold) D(y) for y = 0, 1 , 2 ,... and —iVyD(y) are provable simul¬ 
taneously, i.e. the augmented system is co*inconsistent. (Thus VyD(y) 
becomes refutable, although it expresses the true proposition (y)D{y).) 

We shall not take the space to carry out the formalization of a given 
informal demonstration called for here in the step from (iv) to (v), just 
as we did not for the step from (I) to (II) in the proof of Theorem 30 § 42. 

Hilbert and Bernays 1939 (pp. 205 ff., especially pp. 243—252) carry 
out the formalization of the corresponding part of their proof of Godel’s 
completeness theorem in another formal system, whence it can be in¬ 
ferred that Theorem 36 holds for ours (cf. the remarks on that system 
preceding Example 9 § 74). 

Theorem 37 oC . Given an enumerably infinite (or finite) class of pred¬ 
icate letter formulas F 0 , F x , F 2 , . .., if every conjunction of a finite number 
of them is irrefutable in the predicate calculus, then they are jointly satisfiable 
in the domain of the natural numbers, by a satisfying assignment of natural 
numbers z 0 , z v z 2 , ... to the distinct variables z 0 , z v z 2 , ... which occur 
free, and of predicates P 0 , P v P 2 , ... to the distinct predicate letters 
P 0 , P x , P 2 , .. . which occur, in formulas of the class. The lists z 0 , z v z 2 , ... 
and P 0 , Pj, P 2 , ... may be finite or infinite. (Godel’s completeness 
theorem for infinitely many formulas, 1930 .) 

Proof. For example, suppose now that for some k, F k is 

Vx fci3yfci Vx fc2 Vx fe3 3 yfc2 Vx fc4 B ( z ^i< z %< x *i.y M . x fc2 > x * 3 - y fc2 . x m)- Then we 

take z ekv z e/t2 , 3 Ti(%i). yic 2 ( x ki> x k 2 > x k:i) to be 2°• 3 ekl , 2°• 3 ek -, 2 1 - 3 k ■ 5 Xkl , 
2 2 • 3 fc • 5 Xkl ■ 7 X * 2 • 11 Xhi . The set F 0 is to be the sum of the sets F 00 , F 10 , F 20 , ... 
formed as was illustrated. 

Corollary 1 oC . If for each assignment in the domain of the natural 
numbers to the free variables and predicate letters of the formulas G 0 , G x , 
G 2 , ..., one of those formulas takes the value t, then some disjunction of a 
finite number of them is provable in the predicate calculus. 
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Corollary 2 c . If F 0 , F x , F 2 , ... are jointly satisfiable in some non¬ 
empty domain (or even if each conjunction of a finite number of F 0 , F x , 
F 2 , ... is satisfiable in a respective non-empty domain (Godel 1930 )), 
then F 0 , F x , F 2 , ... are jointly satisfiable in the domain of the natural 
numbers. (A generalization of Lowenheim’s theorem, Skolem 1920 .) 

Proof of Corollary 2 . {Each conjunction of a finite number of 
F 0 , F 1( F 2 , ... is satisfiable in a respective non-empty domain} -> 
{the negation of each such conjunction is not valid in every do¬ 
main} {each such conjunction is irrefutable} (by Theorem 21 § 37) 

{F 0 , Fj, F 2 , ... are jointly satisfiable in the domain of the natural 
numbers} (by the theorem). 

Theorem 38° c . If the enumeration F 0 , F x , F 2 , ... of formulas in 
Theorem 37 is effective {if infinite), the jointly satisfying predicates P 0 , 
P v P 2 , ... can be chosen so that P 3 (a x , .... a n j) = {Ex){y)R j {a 1 , ..., a n ., 
x, y) = {x){Ey)S } {a 1 , ..., a n ., x, y) where R } and S } are primitive recursive 
{j = 0, 1,2, ...). The hypothesis that the enumeration F 0 , F x , F 2 , ... is 
effective can be made exact by requiring, for a suitable Godel numbering, 
that the Godel number of F fc be a general recursive function of k (cf. 
§61), or by use of Turing machines in the manner suggested at the end 
of §70. 

Proof. Now the class aH(a) (cf. the proof of Theorem 35) is re¬ 
cursively enumerable (but not necessarily recursive). However by taking 
F 0 (a) = ( En)F 0 (a , n) where F 0 (a, n) = {a is the Godel number of a 
provable formula of the system obtained by adjoining the first n of the 
formulas F 0 as axioms to the propositional calculus}, and using (17) § 57 
(or end § 53), we still have F 0 (a) = (Ey)R(a, y) with a recursive R. 

The significance of Godel’s completeness theorem and Lowenheim’s 
theorem (including the versions given in § 73) will be discussed in § 75, 
which may be read without the starred § 74, provided the reader will 
accept a few plausible statements referring to § 74. In § 76 somewhat 
more use is made of § 74. 

If F is deducible from G in the predicate calculus with a postulated 
substitution rule (end § 37), then F is valid in every domain in which G is; 
and hence interdeducible formulas are valid in the same domains. The 
converse is true when only 0 - and 1 -place predicate variables occur, by 
use of theory cited in §76; but not in general, by Hasenjaeger 1950 . 
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§ 73. The predicate calculus with equality. A treatment of 
equality may be combined with the predicate calculus by adding to the 
postulates the following axiom and axiom schema, where x is a variable, 
A(x) is a formula, and a and b are distinct variables free for x in A(x): 
22 . a^a. 23. a=b 3 (A(a) 3 A(b)). 

For the pure predicate calculus with equality, ‘term’ shall mean variable, 
and ‘formula’ shall have the sense of equality and predicate letter formula, 
which we obtain from ‘predicate letter formula’ (§31) by adding to the 
definition a clause which states that, if s and t are terms, s=t is a formula. 
An equality and predicate letter formula containing no predicate letters 
other than the distinct letters P x , ..., P s we call a letter formula in =, 
P x , ..., P„. 

(A) Axiom 22 (which is *100 § 38) and a=b 3 (a=c 3 b—c) (Axiom 
16 § 19, which is now an axiom by Axiom Schema 23) we call the open 
equality axioms for =. The n following axioms by Schema 23, 

a =b 3 (P(a 1; ..., a,.!, a, a i+1 , ..., a„) 3 P(a a , ..., a,_ lf b, a f+1 , ..., a„)) 
(i = 1, ...,«), where P is an n-place predicate letter and a lt 
a.i_ v a, b, a i+1 , ..., a„ are some w+1 distinct variables, we call the 
open equality axioms for P. The closed equality axioms are the closures 
of the respective open equality axioms (with which they are inter- 
deducible, end § 32). By “Eq(=, P 1; ..., P s )” we denote the conjunction 
of the closed equality axioms for — , P x , ..., P,. 

Example 1. If cT takes two arguments, Eq(=, <37) is the formula 
Va[a=a] tk s da'ib'1c[a=b 3 (a=c 3 b—c)] & 

VaVbVc[a=b 3 (<37 (a,c) 3 £l{b,c))]&VaVbVc[a=b 3 (<37(c,a) 3 <37(c,/>))]. 

(B) From the equality axioms for = (Axioms 22 and 16), we can 
deduce in the predicate calculus the reflexive (* 100 ), symmetric (* 101 ) 
and transitive (* 102 ) properties of equality, and both the special re¬ 
placement properties (*108, *109) as in §38. From any axiom a=b 3 
(A (a) 3 A(b)) by Schema 23 with A(x) not containing a or b free (in 
particular, *108, *109 or an equality axiom for a predicate letter P) and 
*101 we can deduce a=b 3 (A(a)~ A(b)) (as we did in a slightly different 
format when we inferred *115 and *116 from *108, *109 and *101). 

Now and in § 75, when we are dealing with a class of equality and 
predicate letter formulas containing predicate letters only from a given 
list, we shall use “Q” to stand for some particular 2 -place predicate 
letter not in the list. Given a formula of the class, call it “E” or “E(=)”, 
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by “E Q ” or “E(Q)” we shall mean the predicate letter formula obtained 
from it by replacing simultaneously each part of the form s=t where 
s and t are terms by Q(s, t). 

The notions of ^-identity and ^-equality (§ 36), and of satisfiability 
and validity in a given non-empty domain, are extended to equali¬ 
ty and predicate letter formulas by providing that a=b shall have 
t as value (f as value) when a and b assume the same (different) 
objects from the domain as their values; or if we have first substituted 
numerals, that a—b have the value t or f according as a = b or a ^ b. 
Thus a letter formula F in =, P x , ..., P, will be satisfied by z v ..., z Q , 
P v ..., P s , if and only if F Q is satisfied by z v ..., z q , Q, P v ..., P, with 
Q(a, b) =- a—b. 

In Godel’s extension of his completeness theorem to the predicate 
calculus with equality ( 1930 ), it is necessary to allow the alternative that 
the domain be finite, since e.g. a^b & Vc (c=a V c=b) is satisfiable in 
and only in a domain of two objects. 

Theorem 39 (oC) . Theorems 20 (§ 36), 21 c (§ 37), 34 oC and 37 oC (§ 72) 
and their corollaries hold reading “equality and predicate letter formula”, 
“predicate calculus with equality”, “satisfiable [valid, each assign¬ 
ment) in the domain of the natural numbers or a [and every, or a) non¬ 
empty finite domain” in place of “predicate letter formula ”, “predicate 
calculus”, “satisfiable [valid, each assignment) in the domain of the 
natural numbers”, respectively. (The theorems thus extended we cite 
using a star “*”; the marks “°” and “ c ” apply to the starred theorem 
when they apply to the unstarred.) 

Proofs. Theorem 34*. Let F be a letter formula in =, P x , ..., P t 
which is irrefutable in the predicate calculus with equality. By (A), 
Eq(=, P x , ..., P s ) is provable in the predicate calculus with equality. 
Hence by *45 § 27, F & Eq(=, P x , ..., P„) is irrefutable in the predicate 
calculus with equality, a fortiori in the predicate calculus (using equality 
and predicate letter formulas). Then by Theorem 15 §34, F Q & Eq(Q, 
P x , ..., P s ) is irrefutable in the pure predicate calculus. Hence by 
Theorem 34, F Q & Eq(Q, P x , ..., P,) is satisfiable in the domain of the 
natural numbers. The proof is completed by (a) of the following lemma. 

Lemma 24°. (a) If F Q &Eq(Q, P x , • •P s ) is satisfiable in a given 

non-empty domain D, then F is satisfiable in a non-empty domain D* 
with the same or a lesser cardinal number. 

(b) If FJ? & Eq(Q, Pfei, .... Pfe S J where P*i, ..., V kSk are the predicate 
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letters of F k (k = 0, 1,2, .. .) are jointly satisfiable in a non-empty domain D, 
then F 0 , F lf F 2 , . . . are jointly satisfiable in a domain D* with 0 < D* < D. 

Proofs, (a) Suppose given a satisfying assignment z lt .... z , 
Q,P V .... P s for F« & Eq(Q, P 1( ..., P f ). Then Q,P V .... P s must 
satisfy the following formulas: (i) VaQ{a,a), (ii) Va\fb\Q(a, b) Z>Q(b,aY\, 

(iii) VaVAVc[Q(*. b) & Q(b, c) D Q (a, c)], (iv) V[Q(a, b) D 

(P;( a l> * * •> a i- 1’ a > a t+l> • * •> a ?y) * * •> a «—1> a /-f-l> • • •> a «/))] 

(j = 1, ..., s; i — 1, .. n,). We see this by paralleling set-theoretically 
the reasoning given proof-theoretically under (B); or by noting that the 
conjunction of these formulas is implied by (in fact, is equivalent to) 
Eq(Q, P lf ..P s ) in the pure predicate calculus, and using Theorem 21 
§ 37 and the valuation tables for & and 3 (or ~). (For proving Theorem 
34*, this step can be avoided by using this conjunction in place of 
Eq(Q,Pi,-P.)-) 

From the form of (i) — (iii) and the valuation procedures for V, D 
and &, we see that the logical function Q(a, b) which fulfils them must 
be reflexive, symmetric and transitive. (A relation Q{a, b) with these 
three properties we call an equivalence relation .) Hence the domain D 
falls into equivalence classes with respect to the relation Q, i.e. mutually 
exclusive non-empty classes such that any two members a and b of D 
belong to the same class, if and only if Q(a, b). Then since the formulas 

(iv) are satisfied, the value of Pj(a v ..a nj ) for any j (j = 1, ..., s) is 
unchanged by changing any one of its arguments to another in the same 
equivalence class. 

Now let us take the equivalence classes as a new domain D* (with 
0 < D* < D ); and define objects z*, ..., z* from D* and logical functions 
Q*,P*, ...,P* over D* thus: zf is the equivalence class to which z s 
belongs, i.e. z* = bQ(z j ,b); and Q*(a*,b*) {Pf{a*, . ..,«*,)) shall have 
the value which Q(a, b) (Pj(a v ..., a n f)) takes for any a which 8 a* and 
b 6 b* (for any a, 6 , a nj 8 a*.). Then z*, Q*, P \, ..., P* 

satisfy in D* any formula which z v .. .,z q , Q, P x . P s satisfy in Z); 

in particular they satisfy F e in D*. But Q*(a *, b*) = a* = b*. Hence 
z*, ..z*, P*, ■... P* satisfy F in D*. 

Theorem 40 oC (= Theorem 38*). Let one of the predicate letters 
P 0 , P 1( P 2 , ... occurring in F 0 , F 1; F 2 , ..., say P 0 , be a 2-place predicate 
letter. If the enumeration F 0 , F 1( F 2 , ... is effective (if infinite), then there 
are a domain D* and jointly satisfying predicates P*, Pf, P* , ... for 
Theorem 37* such that'. If D* is infinite, and for a suitable enumeration 
$!» ® 2 > • • • ^ P’oi^ai Sft) = ® = b, then Pj ( Sa 1 , ..., Sa„p = 
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(Ex){y)R*(a lt ..., a nj , x, y) = (x)(Ey)Sf(a v ..., a nj , x, y) where Rf and Sf 
are primitive recursive (/ = 1 , 2 , 3 , ...). 


Proof. By Theorem 38, jointly satisfying predicates Q, P 0 , P s for 
the formulas Fj? & Eq(Q, P fc i, ..P ks k ) (k = 0, 1,2, ...) can be chosen 
which are expressible in both the 2-quantifier forms of Theorem V § 57, 
and hence by Theorem XI § 58 are general recursive in 1-quantifier 
predicates. 

By the definition of the predicates P* in the proof of Lemma 24, 
(1) P*(s ai , . . ., Sa„.) = (Ec x ) . . . (Ec nj ) [C x C S«! & . . . & C nj £ Sa„. & 

Pj( c v • • • > C»y)] = ( c l) • • • ( C nj) C nj £ S a -+ P, (c v .... C n ,) j. 

Let z be some member of s 0 (which is not empty); then $ 0 = cQ(z, c). 
By the definition of P*, P 0 (d, c) holds if and only if P*(d*, c*) where d* 
and c* are the equivalence classes to which d and c, respectively, belong. 
But by hypothesis, P*(s a , s 6 ) =a'=b. Hence if dZs a , then c£s 0 . if 
and only if P 0 {d, c). Since no s 0 is empty, (a){Ed)[d 8 sj. Thus 


c£s 0 = <?(z, c), 

c£s a , = (Ed) [ dZs a 8c P 0 (d, c)] = (d)[d£s a -+ P 0 (d, c )]. 


Applying Theorem VI* § 57 or XI* § 58 to the second line, this shows 
that the predicate c £ s a is general recursive in Q, P 0 . In more detail: Con¬ 
sider the second line of ( 2 ) as of the form 


H(c) ^ (Ed)[K(d) & P 0 (d, c)] ^ (d)[K(d) P 0 (d, c)]. 


Let q, k, p, h, f be function letters expressing the representing functions 
of Q, K, P 0 , H, c £ s a , respectively. By Theorem VI* or XI*, H(c) is general 
recursive (uniformly) in K(d), P 0 (d, c), i.e. there is a system of equations 
(containing k, p, h) defining recursively (the representing function of) 
H(c) from (those of) K(d), P 0 {d, c). By Lemma VI § 65, we can introduce 
a parameter to obtain equations E defining recursively H(c, a) from 
K(d,a), P 0 (d,c). Then E with the three equations f(c, 0 )=q( 2 , c), 
k(d, a)=f(d, a), f(c, a')=h(c, a) defines c £ s a recursively from Q, Po. 
as we see with the help of induction on a. 

Since Q, P 0 are in turn general recursive in 1-quantifier predicates, 
so is c£s a ; and hence by Theorem XI, c £ s a is expressible in both 
the 2 -quantifier forms. 

Using the (Ex) (y )-expressions for c £ s a and P i (c 1 , ...,c n ,) in the 
middle expression of ( 1 ), advancing the quantifiers (by the informal 
analogs of *91 and *87 § 35), and contracting (by (17) §57), we obtain 
an (Ex)(y)-ex pression for P*(s 0l , • • •, s a ). An (x)(Ey )-expression is 




§ 73 THE PREDICATE CALCULUS WITH EQUALITY 403 

obtained similarly, using instead the (x)(Ey )-expression for P {c v ..., c, ly ) 
and the right member of (1) (and applying *91, *96, *95 *87, *98, *97, 
(17), (18)). 

We now consider also applied predicate calculi with equality, in which 
the terms and formulas are constructed using the logical symbolism of 
the predicate calculus with certain individual symbols e x , ..., e ? , function 
symbols f x , ..., f r and predicate symbols —, P x , .. ., P s (but no predicate 
letters). The number-theoretic definitions of ‘term’ and ‘formula’ (§ 17) 
provide an example (with q = 1, r — 3, s = 0). Although commonly each 
function or predicate symbol takes a number n > 1 of arguments, we 
may allow n > 0 , in which case individual symbols may be included 
among the function symbols, and proposition symbols among the pred¬ 
icate symbols. We shall write e.g. "f(s, t)” for the term constructed by 
placing terms s and t in the respective argument positions of a 2-place 
function symbol f, even though in a given system some other manner of 
combining the symbols may be used, e.g. “f” may stand for + and 
“f(s, t)” for s+t. 

The predicate calculus with equality is dealt with at length in Hilbert- 
Bernays 1934 pp. 164 ff. 

We shall see (Theorem 41 (b)) that in an applied predicate calculus 
with equality, Axiom Schema 23 is replaceable by a finite list of particular 
axioms without changing the provability and deducibility notions. 
The idea has already been used in setting up the number-theoretic system 
without Axiom Schema 23; in that case the particular axioms which 
replace Schema 23 did not appear as postulates except Axioms 16 and 
17, since the rest were deducible from the other number-theoretic axioms. 

(C) For the applied case, we read “predicate symbol P other than =” 
in place of “predicate letter” in (A). The open equality axioms for an n-place 
function symbol f shall be the n formulas 

a=b 3 f(a x , ..., s., ‘h+i> • • •> 3-n) f(^-i, • • •> A— 1 > b, a i+x , ..., a„) 

(i = 1, These are deducible in the predicate calculus from 

Axiom 22 and axioms by Axiom Schema 23; e.g. (with n — 2 ), 
a—b D f(a, c) = f(b, c) is deducible by *3 and D-elim. from 
a=b D (f(a, c) = f(a, c) D f(a, c) = f(b, c)), which is an axiom by Axiom 
Schema 23, and f(a, c)=f(a, c), which is deducible by substitution from 
Axiom 22 . Except in the case of =, the open equality axioms for a symbol 
are what we previously called the special replacement properties (cf. 
Theorem 23 § 38). We use “Eq( = , P x , . . ., P s , f 1( ..., f,.)” for the con¬ 
junction of the closed equality axioms for =, P 1( ..., P 5 , i v ..., f,.. 
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(D) Given the equality axioms for =, P 1( .... P s , f 1( ..f r , we can 
use the results of (B) and the former method of proof to establish Theorem 
24 and its corollaries for the case the part r which is replaced does not 
stand within the scope of any predicate symbol or letter, or function 
symbol, other than = , P 1( ..., P„ i lt ..., f r . 

(E) In any system in which Theorem 24 holds under the restriction 
just stated, every axiom by Axiom Schema 23 containing only the 
predicate symbols or letters =, P 1( ...,P S and the function symbols 
f 1( .,., f r is provable. For under the stipulations for the schema, by 
applications of Theorem 24 which replace by b each occurrence of a in 
A(a) which entered by substitution for x in A(x), we obtain a=b 
b A(a) ~ A(b) with no variables varied. Thence by *17a and D-introd., 
b a=b D (A(a) => A(b)). 

Theorem 41. (a) In the predicate calculus with equality, the reflexive 

(*100), symmetric (*101), transitive (*102) and replacement (Theorem 24 
and corollaries) properties of equality hold. 

(b) T b E in the applied predicate calculus with equality having 
= , P^ ..., P 4 , f x , ..., f r as its predicate and function symbols, if and only 
if T, Eq(=, P x , ..., P s , f 1( .... f r ) b E in the predicate calculus. 

(c) r b E in the pure predicate calculus with equality, when T, E are 
letter formulas in —, Px, ..., P s , if and only if T, Eq(=, P x , .... P s ) h E 
in the predicate calculus. 

Proofs, (a) By (A) — (D). 

(b) By (A) - (E). 

(c) Similarly, if we can first exclude the possibility that the deduction 
T b E requires axioms by Axiom Schema 23 containing other predicate 
letters P s+1 , ..., P s+< besides Px, .... P s . The deduction T b E is a 
deduction in the predicate calculus of E from T, a=a and axioms by 
Axiom Schema 23. By the method of Remark 1 § 34, we can thence 
obtain a deduction of E from T, a=a and axioms by Axiom Schema 
23 which contain only the predicate letters P x , ..., P s . 

Example 2 . As a fifth example for the Converse of Thesis II § 60, 
for a primitive recursive R{x,y) (cf. Example 1 §60), let <p v ■.y k 
be a primitive recursive description of the representing function 9 (= cp k ) 
of R. Let fj, ..., f fc be distinct function symbols (to express 9 X , ..., 9 *, 
respectively). Let the terms and formulas of S be constructed using the 
logical symbolism of the predicate calculus with the individual symbol 0 , 
the function symbols ', f x , .... i k and the predicate symbol =. Let S 
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have the postulates of the predicate calculus and as particular axioms 
the equations obtained by translating the schema applications for the 
description cp x> ..., <p fc (as Example 1 § 44 was translated to get Example 3 
§ 54, but with 1 = 0), together with the open equality axioms for =, ', 
f x , .. i k . Let A(x) be the formula 3yf fc (x, y) = 0. In 5 the substitution 
rule R1 of § 54 holds as a derived rule (by § 23), and likewise the re¬ 
placement rule R2 (by Theorem 41 (a) and (b)). Hence if, for some y, 
R(x, y) is true, i.e. <p k (x, y) = 0, then f fc (x, y) =0 is provable in S, and by 
3-introd., so is A(x). Conversely, A(x) is provable in 5 only if ( Ey)R(x, y), 
as will be shown in Theorem 52 § 79. 


*§ 74. Eliminability of descriptive definitions. At various stages 
in the informal development of a mathematical theory additions may 
be made to the stock of concepts and notations. If the development is 
formalized, at the corresponding stages new formation rules and postulates 
are added to a given formal system S x to obtain another S 2 . Thus the 
formulas (provable formulas) of become a subset of those of S 2 . The 
new formation rules introduce new formal symbols or notations, and the 
new postulates provide for their use deductively. We shall write “hi” 
(“h 2 ”) for the deducibility relation in S x (in S 2 ). 

Under such circumstances, we say that the new notations or symbols 
(with their postulates) are eliminable (from S 2 in Si), if there is an ef¬ 
fective process by which, given any formula E of S 2 , a formula E' of S x 
can be found, such that: 

(I) If E is a formula of S x , then E' is E. 

(II) h 2 E ~ E'. 

(III) If T ha E, then T hi E'. 

Here T' is D' v ..., D,\ if T is D x .D t . We call (I) — (III) the elimi¬ 

nation relations. (In Example 13, slight modifications are called for in 
our formulation of “eliminability”.) 

When the elimination relations hold, then furthermore: 

(IV) T h 2 E, if and only if T' hi E'. 

(V) If T, E are formulas of S v then T ha E if and only if T hi E. 

Proofs. (IV) For conversely to (III): If T' hi E', then a fortiori 
T' h 2 E'; thence by (II), T h 2 E. 

(V) By (I) with (III) or (IV). 
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Thus an eliminable extension of S x to obtain S 2 is inessential, in that 
by (V) it gives no enlargement of the class of the original formulas which 
are provable, while by (II) any new formula is equivalent in the enlarged 
system to one of the original formulas. 

Example 1. Eliminability of explicit definitions. In studying 
our formal system, we regarded ■—', 1, <, 3!x, x 2 , etc. as merely ab¬ 
breviations in the presentation of the metamathematics (cf. end § 17). 
Alternatively, we could have treated them as successive additions to the 
formal symbolism. In this case, each time we should have added a 
new formation rule (if A and B are formulas, so is A ~ B; 1 is a term] 
if s and t are terms, s<t is a formula ; if x is a variable and A(x) is a 
formula, 3!xA(x) is a formula ; if s is a term, s 2 is a term ; respectively), 
and a defining axiom or axiom schema 

({(A ~ B) D (A Z) B) & (B D A)} & {(A Z> B) & (B D A) Z> (A ~ B)}, 
1=0', a<b ~ ^c{c'+a—b), 3!xA(x) ~ 3x[A(x) & Vy(A(y) D x=y)], 
a 2 =a-a, respectively, with appropriate stipulations about A, B, x, A(x), y). 
These additions are eliminable, with ' the operation previously regarded 
as unabbreviation. Generally this is the case under the following con¬ 
ditions (briefly stated). The defining axiom or schema is an equivalence 
(an equation). Then shall be the predicate calculus (the predicate 
calculus with equality) and possibly additional particular axioms and 
axiom schemata. Theorem 14 § 33 (Theorem 24 (b) § 38) is used in proving 
(II). The two conventions for "permanent abbreviations’’ end § 33 
shall apply; the first of these is used in treating the case for Rules 9 
and 12 in proving (III), and the second for Axiom Schemata 10 and 11 , 
and an additional convention can be adopted to fix which bound varables 
are used in the E' for each E. For each additional axiom schema, 
whenever A is an axiom of S 2 by it, A' must be provable in Sj. 

Example 2. Let S 2 be our system of the classical predicate calculus 
with the addition of Axiom Schema 9a of Lemma 11 § 24, which by *95 
§ 35 is redundant in S 2 . (An axiom or axiom schema or several such are 
redundant in a formal system S, if the axioms involved are all provable 
in the system left after omitting those axiom(s) or axiom schema(ta) 
as postulates.) Let be what remains from S 2 when &, V, 3 and their 
postnlates are omitted. Then the elimination relations hold, when ' is 
the replacement of &, V and 3 by the equivalents given by *60 and *61 
§ 27 and *83 § 35. (In treating the case for Axiom Schema 6 in the proof 
of (III), assume A D C, B D C, ~iA Z) B and -iC, and deduce both B 
and -vB with the help of *12. Then use-i-introd. and elim., and D-introd.) 
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Example 3. In our number-theoretic system, b ~ A D 1 =0. 
This enables -i to be eliminated, if (redundant) postulates 8 ' (or 8 1 ') and 
15' are first added, where 8 ' is ((A D 1=0) D 1=0) 3 A, etc. (But lb 
includes 7'.) 

An elimination theorem can be looked upon as a new sort of derived 
rule (§ 20 ) for the study of the original system S v It enables us, when our 
purpose is to establish the provability of a formula E in S v to undertake 
instead to prove E in a suitable system S 2 having more machinery than 
S v Also by it any formula E not of S 1 which we prove in S a can be con¬ 
strued as an abbreviation for a provable formula E' of S v 

An elimination theorem has as corollaries that the simple consistency 
of S 2 is implied by that of S x (using (V), cf. § 28) and the decision problem 
for S 2 is reduced to that for Si (by (IV), cf. § 30, end § 61). 

Example 4. Non-eliminability of the recursive definition of *. 
The function symbol • with its recursion equations (Axioms 20 and 21) 
is not eliminable from the formalism of Chapter IV, call that S 2 . For 
by Presburger’s result (beginning § 42) the remaining system S x is 
simply consistent and its decision problem is solvable. If • were 
eliminable, the same would be true of S 2 , contradicting Theorem 33 § 61. 

Eliminability of descriptive definitions. In a descriptive definition, 
an object / is defined as the w such that F(w) (in symbols, iwF(w)), 
where F is a predicate for which it is known that there is a unique w 
such that F(w) (in symbols, ( E\w)F(w )). If w is the only independent 
variable for F, then iwF(w) is an individual /. If F depends on n other 
individual variables, write it “F{x v w)”, then t wF(x v ..., x n , w) 

is a function f(x v ..., x n ). We shall include the first case in the second by 
considering an individual as a function of 0 variables (accordingly, we 
take ‘function symbol’ to include ‘individual symbol’ in this section). 

The logic of descriptive definitions was treated by Whitehead and 
Russell 1910 (pp. 30—32, 66—71, 173—186 in the 2nd ed. 1925 ). 

Their eliminability was established by Hilbert and Bernays 1934 pp. 
422—457. Another proof was indicated by Rosser 1939 . These proofs 
establish the eliminability of iw as a formal operator, the admission of 
which into a given formalism introduces all the possibilities for de¬ 
scriptive definitions at once. Notationally it is simpler to use function 
symbols. A mathematician developing a theory will ordinarily introduce 
new function symbols successively as the need for them arises. We shall 
establish the eliminability of function symbols introduced in this manner. 
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It suffices to consider the introduction of one new function symbol at a 
time. 

Theorem 42. Let S x have the formation rules of an applied predicate 
calculus with equality (§ 73), and let the postulates of S x he those of the 
predicate calculus with additional particular axioms and axiom schemata 
such that the equality axioms for the function and predicate symbols of S x 
are provable (or let S x be simply an applied predicate calculus with equality 
and possibly additional particular axioms and axiom schemata). 

Let x v .. .,x n ,w (n> 0) be distinct variables, and F(x 1( .. x n , w) be 
a formula which contains free only x x , .. ,,x„,w, and in which x x , .. x n 
are free for w. Suppose that 

(i) 3!wF(x x , .... x B , w) 
is provable in S v 

Let S 2 be obtained from S x by adjoining a new n-place function symbol f 
and the new axiom 

(ii) F(x 1( .. .,x n , f(x x , .... x„)). 

Then the new function symbol f and its axiom are eliminable, i.e. (I) — 
(III) (and hence (IV) and (V)) hold for a certain effective correlation ' 
(which will be specified in the proof), provided each of the additional 
axiom schemata has the property that, if E is an axiom of S 2 by it, then E' 
is provable in S v 

Proof is provided by Lemmas 25 — 31. By Theorem 41 (b), it is im¬ 
material whether we consider the logic to be the predicate calculus or 
the predicate calculus with equality, when we know the equality axioms 
for the function and predicate symbols to be provable. Hence from the 
outset this is immaterial in the case of S v and it will be in the case of S 2 
as soon as we learn (in Lemma 27) that the equality axioms for the new 
function symbol f are provable in S 2 . 

Lemma 25. In the predicate calculus with equality, if u, v and x are 
distinct variables, F(v), C(v), C(u, v), A, B, A(v), B(v) and A(x, v) are 
formulas, u is free for v in F(v) and C(u, v), A and B do not contain v free, 
and F(v) does not contain u or x free: 

*181. 3!vF(v) b 3v[F(v) & C(v)] ~ Vv[F(v) 3 C(v)]. 

*182. 3!vF(v), C(v) b v 3 v[F(v) & C(v)]. 

*183. 3!vF(v) b 3u[F(u) & C(u,u)] ~ 3u[F(u) & 3 v[F(v) & C(u, v)]]. 

*184. 3!vF(v) b 3v[F(v) & (A 3 B(v))] ~ A 3 3 v[F(v) & B(v)]. 

*185. 3!vF(v) b 3v[F(v) & (A(v) 3 B)] ~ 3v[F(v) & A(v)] 3 B. 
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*186. b 3v[F(v) & A & B(v)] ~ A & 3v[F(v) & B(v)]. 

* 187. 3 !vF(v) b 3v[F(v) & (A V B(v))] ~AV 3v[F(v) & B(v)]. 

*188. 3!vF(v) b 3v[F(v) & iA(v)] ~ i3v[F(v) & A(v)]. 

*189. 3!vF(v) b 3v[F(v) & VxA(x, v)] ~ Vx3v[F(v) & A(x, v)]. 

*190. b 3v[F(v) & 3 xA(x, v)] ~ 3x3v[F(v) & A(x, v)]. 

(Cf. David Nelson 1947 Lemma 23 pp. 347—348.) 

Proofs. *181. Assume 3 !vF(v), F(v)&C(v) (for 3 -elim.) and F(t) 
where t is a new variable (for D-introd.). Then using *172 § 41, v=t; 
and hence by replacement, C(t). By D- and V-introd. and change of 
bound variables, Vv[F(v) D C(v)]. The variable of the V-introd. was t, 
so no variable has been varied. By 3-elim. and D-introd., 

3v[F(v) & C(v)] D Vv[F(v) DC(v)]. For the converse, assume 3!vF(v) 
and Vv[F(v) D C(v)]. From 3!vF(v), we get 3vF(v). Preparatory to 
3-elim., assume F(v). 

*183. 3 u[F(u) & C(u,u)] b 3u[F(u) & F(u) & C(u, u)] (*37 §27) 

b 3u3v[F(u) & F(v) & C(u, v)] (*80 §35) b 3u[F(u) & 3y[F(v) & C(u, v)]] 
(*91). The converse can be based similarly on *79, with help from *181, 
*4 § 26 (with *69 § 32), etc. 

*184. 3!vF(v) b 3v[F(v) & (A D B(v))] ~ Vv[F(v) D (A D B(v))] 

(*181) — Vv[A D (F(v) D B(v))] (since using *3, b B D (A D C) ~ A D 
(B D C)) ~ A D Vv[F(v) D B(v)] (*95) ~ A D 3v[F(v) & B(v)] (*181). 

*187. 3!vF(v) b 3vF(v) b 3v[F(v) & (A V B(v))] ~ 3v[(F(v) & A) 
V (F(v) & B(v))j (*35) — (A & 3vF(v)) V3v[F(v) & B(v)] (* 88 , *91 with 
*33) ~ A V 3v[F(v) & B(v)] (*45). 

*188. Use successively *181, *58b, * 86 . 

Lemma 26. In the predicate calculus with the equality axioms for the 
function and predicate symbols of Si only but with f admitted to the symbolism 
(a fortiori, in the predicate calculus with equality), the conjunction of (i) and 

(ii) is interdeducible with 

(iii) f(xj, ..., x„)=w ~ F(x x , ..., x n , w). 

Hence (iii) is provable in S 2 . 

Proof. From (ii) and f(x x , ...,x„)=w, we deduce F(x,... .,x n , w) 
by the replacement property of equality, which by (D) § 73 requires 
only the predicate calculus with the equality axioms for = and the 
function and predicate symbols of F(x x , ...,x„, w). From (i), (ii) and 
F(xj, ...,x„, w), we deduce f(x x , ...,x n )=w by *172. From (iii), we 
deduce (i) with the help of *171, and (ii) by substituting f(x x , ..., x n ) 
for w in (iii) and using Axiom 22 § 73. 
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For the rest of the proof of Theorem 42, we shall use "F(x x , ..., x„, w)” 
under the convention which applies to permanent abbreviations (cf. 
end § 33) that, for any terms t x , . .., t„, s, “F(t x , ..., t„, s)” shall denote 
the result of substituting t 1; ..., t n , s for x 1( ..., x n , w in F(x x , ..., x n , w) 
after any legitimate change of bound variables in F(x x , . ..,x„, w) 
which makes the substitution free. 

Lemma 27. In the system of Lemma 26, the equality axioms for f are 
deducible from (iii). Hence they are provable in S 2 . 

Were this not so, adding only (ii) to the axioms of S x would not give 
effective use of f in S 2 (cf. Theorem 41 (a) and (b)). 

Proof. For example, if n = 2 (and writing a, c for x x , x 2 ), we can 
prove a=b D f(a, c) = f(b, c) thus. Assume a=b. By replacement, 
F(a, c, f(b, c)) ~ F(b, c, f(b, c)). Thence using (iii), f(a, c)=f(b, c) ~ 
f(b, c)=f(b, c); whence (Axiom 22 and * 18b), f(a, c)=f(b, c). 

Lemma 28. Let t x , . ..,t„ be terms, v be a variable not occurring in 
t x , ..., t„, and C(v) be a formula in which f(t x , ..., t„) is free for v. Then 
C(f(t x , ..t n )) ~ 3v[F(t x , ..., t„, v) & C(v)] is deducible from (iii) in 
the predicate calculus with equality, and hence is provable in S 2 . 

Proof. Assume C(f(t x , ..., t„)). Substituting in (ii) (cf. Lemma 26), 
F(t 1( . ..,t„, f(tj, ...,t„)). Now use &-, 3- and D-introd. Conversely, 
assume (for 3-elim.) F(t x , ..., t„, v) & C(v). Using (iii), f(t x , ..., t„)=v. 
By replacement in C(v), C(f(t x , ..., t„)). 

A prime formula is one containing no logical symbol, i.e. here it is a 
predicate symbol with terms as arguments. A term or formula of S x , 
i.e. one not containing f, we call i-less. A term of the form f(t x , ..., t„) 
where t x , ..., t n are terms we call an i-term ; and if t x , ..., t n are f-less, 
we say it is a plain f-term. An occurrence of a term in a formula is bound 
{free), if it is (is not) within the scope of some (any) quantifier Vy or 3y 
where y is a variable of the term. 

Example 5. Let P and Q be predicate symbols, g be a function symbol 
distinct from f, and x and y be distinct variables. In the following formula, 
the second occurrence of an f-term is bound; the c ther six are free. 

1 . Vx{P(f(y), f(g(x))) & Q(x, x)} 

D P(f(y), f(g(f(y)))) & Q(f(y), f(y)). 

The f-terms f(y) and f(g(x)) are plain, but f(g(f(y))) is not, as the f-term 
f(y) is nested within it. 
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Lemma 29. There can be correlated effectively to each formula E of S 2 
a formula E' of S x (called the principal i-less transform of E) in such a 
way that (I) and (II) hold, no free variables are introduced or removed, 
and the operators of the predicate calculus are preserved, i.e. (A DB)' 
is A'DB', (A & B)' is A'&B', (AVB)' is A'VB', (-<A)' is -iA', 
(VxA(x))' is VxA'(x) (where A'(x) is (A(x))') and (3xA(x))' is 3xA'(x). 

Proof. The condition that the operators are preserved defines E' 
by recursion on the number g of occurrences of logical symbols in E, pro¬ 
vided we supply as basis a definition of E' for the case E is prime. We do 
this by induction on the number q of occurrences of f-terms in E, thus. 

If g = q =s 0, E' shall be E. 

If g = 0 and q > 0 , select the first occurrence of a plain f-term 
in E, say it is f(t 2 , ..t„). Let v be a variable which does not occur 
in E. Let C(v) result from E by changing the occurrence of f(t 1 , ..., t n ) 
under consideration to v. Then E' shall be 3v[F(t 1 , ..., t„, v) & C'(v)]. 
(By *181, we could equally well use Vv[F(t 1 , ..., t n , v) D C'(v)].) There 
is an ambiguity here regarding the choice of the bound variable v and 
the manner in which the bound variables of F(x 1( ..., x„, w) are changed 
when necessary to make the substitution of t 1( ..., t„, v for x 1( ..., x B , w 
free. But different legitimate choices lead to congruent formulas (§ 33). 
We may suppose some convention supplied to fix which one is E' itself. 
In our illustrations, it will suffice to use any congruent of E\ Note that 
C(v) is prime and contains just q— 1 occurrences of f-terms. 

The properties of ' mentioned in the lemma follow by a corre¬ 
sponding induction on g (using Theorem 14 §33 for (II)), with in¬ 
duction on q within the basis (using Lemma 28). 

A suggestive abbreviation is to write “F^-- t »C(v) M for 
3v[F(ti, ...,t„, v) &C(v)]. 

Example 5 (continued). Formula 1 is reduced to a congruent of 
its principal f-less tiansform thus. 

2. VxjF^PIv,, f(g(x))j & Q(x, x)> 

3 F;,P(v„ f(g(f(y)))) & f(y)). 

3. Vx{F;,F«“P(v a ,v 3 )&Q(x,x)} 

3 F ?, F ;'„ p ( v !. f(g(v,o)) & fj^fx.qiv,,, v a ). 

4 • Vx{F;,Fy j -'P(v ! ,v,)&Q(x,x)} 

3 F ?, F J„ FV, Tu, P(v I . v 3 ) & Fx.F^.Qiv,,, v„). 
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Let us call the prefixes . tn "F-quantifiers”. By *184—*190 

(since (i) gives us the assumption formulas 3!vF(v)), F-quantifiers can be 
permuted in S 1 with the operators of the predicate calculus and with 
each other (changing bound variables as necessary), subject to the 
restriction that F* 1 ’"*'*" may not be advanced (leftward) over an or¬ 
dinary or F-quantifier which binds any variable of t x , ..., t n (since for 
*189 and *190 the F(v) must not contain the x free). This restriction as 
between two F-quantifiers means simply that two F-quantifiers which 
result from eliminating f-terms one nested within the other must be kept 
always in the same relative order (the one corresponding to the inner 
f-term being to the left). As between an F- and an ordinary quantifier, 
it means that an F-quantifier resulting from the elimination of a bound 
occurrence of an f-term may be advanced leftward only up to the right¬ 
most of the ordinary quantifiers which bound the occurrence. 

Moreover by *183, adjacent like F-quantifiers F* 1 .and F£.can 

be contracted. 

Also *182 constitutes an introduction rule for F-quantifiers. 

Example 5 (concluded). Formula 1 is an axiom VxA(x) D A(t) 
of S 2 by Axiom Schema 10, with f(y) as the t; and we chose the same 
bound variables in eliminating corresponding f-terms of A(x) and A(t) 
in the reduction to Formula 4. Now by *186, *190, *186 (with *33), 
*184 and *183 (with a change of bound variables), we can advance the 
three F-quantifiers F* , F* , F^ w resulting from the elimination of the 
three occurrences of f(y) as the t of the schema application to the front, 
and contract them to one F-quantifier F^. This gives us the following 
formula, as an equivalent in Si of Formula 4 and hence of the principal 
f-less transform of Formula 1. 

5. FyVx'F^F™ P(v„ v„) & Q(x, x)> 

3 F;,F?" ,l P(v„v 1 )&Q(v 1 ,v I )]. 

Now the scope of F* in 5 is an axiom of S x by Axiom Schema 10, with 
v 1 as the t. So by F-introd. (*182), Formula 5 is provable in S v and hence 
also the principal f-less transform of Formula 1. 

Lemma 30. If E is an axiom of S 2 , then hi E'. 

Proof. If E is an axiom of S 2 by any axiom schema of the prop¬ 
ositional calculus, then E' is an axiom of Si by the same schema, since 
' preserves the operators of the calculus (Lemma 29). 

Axiom Schema 10 : E is VxA(x) D A(t), where t is free for x in A(x). 
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Case 1 : t is f-less. Then by choosing the same bound variables for 
corresponding steps in the reductions of A(x) and A(t), we can obtain 
a congruent of E' which is of the form VxB(x) D B(t) where t is free for 
x in B(x). This is an axiom of S x by Axiom Schema 10 , with the same t. 
Hence (- x E'. 

Example 6 . Let E be 

Vx{P(f(y), f(g(x))) & Q(x, x)} D P(f(y). f(g(t))) & Q(t, t), 
where t is f-less. Then 

Vx{FJF*fP(v lt v,) & Q(x, x)} 3 F^Plv,, v,) & Q(t, t) 
is congruent to E', and is an axiom of S x by Axiom Schema 10 . 

Case 2: the general case. Say that A(x) contains k occurrences of 
f-terms and l free occurrences of x, and that t contains m occurrences of 
f-terms. (Illustrated with k = 2,1 = 2, m = \ by Example 5.) Then A(t) 
contains k occurrences of f-terms which originate from A(x) and thus 
correspond to the f-terms of A(x), and Im others consisting of m in each 
of the l occurrences of t which take the place of the free occurrences of 
x in A(x). In the reduction of E to a congruent of E', we can use F- 
quantifiers with the same bound variable in eliminating each of the k 
pairs of corresponding f-term occurrences from A(x) and A(t), and F- 
quantifiers with other distinct bound variables in eliminating from A(t) 
the Im f-term occurrences in the t’s. Say that in the elimination, these l 
occurrences of t become t*, ..., t* (v n , v 12 , v 13 in the example). Since 
none of the k f-term occurrences in A(t) which originate from A(x) can be 
nested within any of the Im which enter by the substitution of t for x, 
and since the substitution is free, the Im F-quantifiers used in eliminating 
the latter can be advanced to the front, and in such an order that each 
group of l of them belonging to corresponding f-term occurrences in the 
l occurrences of t are adjacent, after which each group can be contracted. 
We thus obtain as an equivalent in S x of E' a formula of the form 
(A) F®“ . . ... F ®* 1 . Smn [VxB(x) D B(t*)], 

wheret*results from each of tf, ..., t* by the identification of variables 
in the contraction, and is free for x in B(x). Now VxB(x) D B(t*) is an 
axiom of by Axiom Schema 10 , and (A) is provable thence by m 
applications of F-introd. (*182). 

Axiom (ii): E is F(x 1 , ..., x n , f(x 1( ..., x„)). Suppose w occurs free l 
times in F(x,, ..., x n , w). Then E' contains / F-quantifiers. These can be 
advanced and contracted to give f* 1 '- ,x »F(x 1 , ...,x n , v), i.e. 
3v{F(xj, ..., x„, v) & F(xj, ..., x B , v)}, which is provable from (i). 
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Lemma 31. If E is an immediate consequence of F [of F and G) in S 2 , 
then E' is an immediate consequence of F' [of F' and G') in S x . 

For each of Rules 2, 9 and 12, because ' preserves the operators, and 
does not introduce free variables (so that the C of Rule 9 or 12 is not 
transformed into a formula C' containing x free). 

Remark 1 . If f is not 0 or the induction schema satisfies the 
proviso of the theorem for additional axiom schemata, as is seen by the 
reasoning used for Axiom Schema 10 Case 1. 

Example 7. Let S x be the number-theoretic system of Chapter IV. 
Let S 2 result from S x by adjoining the function symbol rm (to express the 
remainder function) with the axiom 

3q(a=bq+rm(a, b) & rm (a, b)<b) V (b=0 & vm[a, b)=a) 

(cf. *179b, c §41). Then by the theorem (together with Remark 1), rm 
and its axiom are eliminable. — Let the notation [alb] for the quotient 
and the axiom Br[a=b[a/b]+r & r<b) V b=[a/b] = 0 (cf. *178b, c) also 
be adjoined. By successive applications of the theorem, both rm and 
[a/h] can be eliminated. 

Example 8 . Let S x be the number-theoretic formal system, R(x, y) 
be a formula containing free only x and y, and R[x, y) be the predicate 
which R(x, y) expresses under the interpretation, (a) Consider the 
classical system S v and suppose R is such that: (A) f-i 3yR(x, y). 
Then using *149 and *174b, a function symbol f introduced with the 
axiom R(x, f(x)) & Vz(z<f(x) D -iR(x, z)), so that under the (classical) 
interpretation (at least) f(x) expresses the function \xyR[x, y) (beginning 
§ 57, assuming about 5 X that (A) implies (lb) § 57 (with n — 1)), will be 
eliminable, i.e. the elimination relations hold from the system S 2 thus 
obtained. Using *149a instead of *149, the same holds in the intuitionistic 
S v when besides (A) also: (a) Hi R(x, y) V-iR(x, y). (b) Now consider 
the classical S v without supposing (A). Let Rf(x, w) be R(x, w) V 
{-i3yR(x, y) & w=0}, and Rt(x, w) be R f (x, w) & Vz(z<w D -iR + (x, z)). 
By *51, hi 3yR(x, y) V-i3yR(x, y). Thence using cases (V-elim.), 
Hi 3wRt(x, w); and by *149 and *174b: (1) Hi 3!wRt(x, w). So a 

function symbol f introduced with the axiom Rt(x, f(x)) will always 
be eliminable classically. Under the (classical) interpretation, now f(x) 
expresses &yR(x, y) (beginning § 62; cf. (59) §63). The eliminability holds 
in the intuitionistic S x , when R is such that (a) holds and also: 

(P) H 3yR(x, y) V -i3yR(x, y). For (p) enables us to dispense with *51, 
and (a) and (p) with *158 and Remark 1 (b) §29 give Hi R t (x, w) V 
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-iRt(x, w), which enables us to use *149a instead of *149. (c) Let <p(x) be 
any function of which the representing predicate cp(x)—w is arithmetical 
(§ 48). Let R(x, w) be a formula expressing 9 (x)=w. Then the f(x) of (b) 
expresses zw[y{x)=w], i.e. <p(x). Thus in the classical number-theoretic 
system, for any function 9 (x) such that <p(x)=w is arithmetical, a formula 
F(x, w) containing free only x and w can be found such that F(x, to) is true 
under the interpretation exactly when w — <p(x), and a new function sym¬ 
bol f expressing 9 with the axiom F(x, f(x)) is eliminable. In particular, 
such a formula F(x, w) can be found for any general recursive function <y(x) 
(by Theorem VII (b) § 57); and for many functions {under classical inter¬ 
pretation) which are not general recursive, e.g. zyT x {x, x, y) (cf. (b) or be¬ 
ginning § 62, and Example 1 § 63). (d) Conversely, if an axiom of the form 
F(x, f(x)) characterizes f(x) as expressing a function <p(x) (i.e. if for each 
x, F(x, to) is true under the interpretation exactly when w— <p{x)), 
then (f{x) = w is arithmetical. — The situation for the intuitionistic 
system will be considered further in § 82 (Examples 1 and 2). 

For the next two examples, we shall suppose that ( 1 ) — (3) of Remark 
1 § 49 have been established, either by formalizing § 48 directly, or by 
borrowing from Hilbert-Bernays 1934 pp. 401—419. The Hilbert- 
Bernays treatment is in a formal system which (disregarding obviously 
inessential differences, including their use of predicate variables, cf. end 
§ 37) results from our classical number-theoretic system by adding an 
operator (with appropriate postulates) which, applied to a formula 
R(x, y), gives a term expressing the function syR{x, y) where R{x, y) is 
the predicate expressed by R(x, y). By Example 8 (b), each use of this 
operator can be eliminated. Applying the process to the formulas 
displayed fifth on p. 416 and seventh on p. 419, taking the a and 
b(n, p (m, n'- /+1)) to be simply a variable w, we are led to (a) and ((3) of 
Remark 1 § 49. This treatment can be adapted to the intuitionistic system. 
(Hilbert and Bernays write the operator pi x ; their z x has another meaning, 
cf. 1939 pp. 9 ff., Hilbert 1928 .) 

Example 9. Eliminability of further primitive recursive 
definitions, having those for + and • (cf. Example 4). Suppose 
9 1( .. .,9 fc is the primitive recursive description of a function 9 {— 9*.). 
Let S x be the number-theoretic system with additional function symbols 
f lt ...,i k _ x (expressing ...,(p k _ lt respectively), and with equations 
obtained by translating the schema applications for 9^ ..., 9*_ 1 (as 
in § 54) adjoined as axioms. Say e.g. that 9 comes from tji, x (where x 
are from the list <p v ..., 9*_ 1 ) by Schema (Vb) § 43 with n = 2. Applying 
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Remark 1 §49: (1) hi P(0, x, w) ~ Q(x, w), (2) hi P(y', x, w) ~ 

3z[P(y, x, z) & R(y,z,x, w)], (3) hi 3!wP(y, x, w). We also assume (as hypo¬ 
thesis of an induction on k) : (4) hi g(x)=w ~Q(x, w), (5) hih(y,z,x)=w 
~R(y, z, x, w) (where g, h express <J>, y, respectively). The theorem 
(with (3)) tells us that the addition of f to with P(y, x, f(y, x)) as the 
additional axiom is eliminable. But in S x with f added to the symbolism, 
P(y, x, f(y, x)) is interdeducible with f(0, x) =g(x) & f(y', x)—h(y, f(y, x), x), 
as we easily see (using (1) — (5), and in one direction, induction on y). 
Hence the pair of equations f(0, x)=g(x), f(y', x)=h(y, f(y, x), x) may be 
used instead of the formula P(y, x, f(y, x)) as the additional axioms, with 
the same results. (This pair of equations is obtained slightly differently 
in Hilbert-Bernays 1934 on p. 421.) 

Example 10. Let Sj be the number-theoretic formalism. By the 
proof of Theorem 32 (a) § 59, there is a formula P(z, x, w) such that, 
if e is a Godel number of a general recursive function cp(x), then P(e, x, w) 
numeralwise represents <p(x). Then 3!wP(e, x, w) is provable for each 
natural number x. Must 3!wP(c, x, w) be provable? (It certainly is for 
some choices of 9 and e, e.g. by formalizing via § 56 reasoning given in 
§54.) By Theorem 31 §52, {(-j 3!wP(z, x, w)} (Ey)R(z, y) for some 
primitive recursive R. Let 0 be obtained from this R by Theorem XIV 
(b) § 60. Then 0 is primitive recursive, and 0(y) for y = 0, 1,2, ... is an 
enumeration of the numbers z for which 3!wP(z, x, w) is provable. Under 
the interpretation, 3!wP(z, x, w) is true only when 2 is the Godel number 
of a general recursive function of one variable. Suppose that S x has the con¬ 
sistency property that 3!wP(z, x, w) is provable only then. Now for each y, 
0(y) is the Godel number of a general recursive function, which can be 
written O x (0(y), x) (beginning §65); and so (^(©(z), *) + l is a general 
recursive function y(x), for any Godel number e of which 3!wP(e, x, w) 
is unprovable in S v — For such an e, let S 2 be obtained from S 2 by 
adjoining a new function symbol f with the axiom P(e, x, f(x)). Then f 
(and its axiom) are not eliminable. For, using *174a and the result (3) of 
Remark 1 § 49 for the primitive recursive function U (cf. the proof of 
Theorem 32), we easily show that P(e, x, t) hi 3!wP(c, x, w). Hence, 
using the new axiom, h 2 3!wP(e, x, w). If f were eliminable, then 
3!wP(«, x, w) would be provable in S v which is not the case. Thus 
in the number-theoretic system, there is a formula P(x, w) containing free 
only x and w such that, under a consistency assumption : P(x, w) numeral- 
wise represents a general recursive function 9, but a new function symbol 
f {expressing 9) with the axiom P(x, f(x)) is not eliminable. 
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For a given F, f and formation rules (here those of S 2 ), we shall mean 
by a transform of a formula E any formula D such that E ~ D is 
deduciblein the predicate calculus with equality from the formula (iii). 

Remark 2 . (a) The transforms of E include its principal f-less 

transform E' (by Lemma 28 and the proof of Lemma 29) and all formulas 
obtainable thence by manipulations of F-quantifiers based on Lemma 25 
(by Lemma 26). (b) Any two f-less transforms E 1 and E 2 of a formula 

E of S 2 , since equivalent in S 2 (by Lemmas 26 and 27), are equivalent in 
(by (V)). This holds for any 5 X for Theorem 42 with the given formation 
rules (without the proviso for additional axiom schemata, since the proof 
of E 1 ~ E 2 in S 2 uses axioms by these only of Si), and in particular when 
Sj is simply the predicate calculus with equality and (i) as an axiom. 

Replaceability of undefined functions by predicates. Theorem 
43. (a) Let S 2 be an applied predicate calculus with equality which has a 

symbol f for a function of n variables (n ;> 0); and let S x come from S 2 
by omitting f and supplying instead a symbol F for a predicate of «+1 
variables with the axiom 3 !wF(x 1 , ...,x n , w) ( or the postulates of S 2 
may be those of the predicate calculus with the equality axioms for the function 
and predicate symbols of S 2 , in which case in forming S x we also omit the n 
equality axioms for f and supply instead the n+ 1 equality axioms for F). 

For any formula E of S 2 , let E' be any particular i-less transform of E 
{with the present f and F, under formation rules allowing both in the sym¬ 
bolism). For any formula E of S 1 , let E° be the formula of S 2 obtained from E 
by replacing simultaneously each part of the form F(t x , ...,t n , s) where 
t 1( ..., t n , s are terms by f(t x , ..., t B ) = s. Then: 

(Via) h E ~ E 0 '. (VIb) h E ~ E'°. 

(Vila) {r b 2 E}->{r h E'}. (viib) {r k E}-*{r° h 2 e°}. 

(b) Likewise when S 2 has additional particular axioms B 1( ..., B fc 
and axiom schemata Si, . ..,S 8 j, and S x has as additional particular 
axioms Bj, ..., B(. and additional axiom schemata 3l 1( ..., 9I t such that, 
if E is an axiom of S 2 by 93 < {of S x by 9I f ), then hi E' (|- 2 E°). (Cf. Hil- 
bert-Bernays 1934 pp. 460 ff.) 

From (Via) — (Viib) it follows that: 

(Villa) {r h 2 E} = {r' hi E'}. (vnib) (r hi E}^{r o h 2 e°}. 

Proofs. (Villa). For conversely to (Vila): If T' hi E', then by 
(Viib), r h 2 E'°; and thence by (VIb), T h 2 E. 

Proof of Theorem 43. (a) We begin with the version in which 
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the logic is the predicate calculus with equality. Let S 3a come from S 2 
by adding F to the symbolism with f(x 1 , . . ,,x„)=w~ F(x x , ..., x n , w) as 
axiom. Regarding this axiom as an explicit definition of F, the additions 
are eliminable with ° as the correlation, by Example 1. But by Lemma 26, 
the axiom f(x 1 , ..., x„)=w ~ F(x x , ...,x„,w) can be replaced in S 3a 
by the pair of axioms 3!wF(xj, .. ., x„, w) and F(x 1 , ..., x„, f(x 1 , . . ., x n )) 
without changing the deducibility relationship. So the elimination 
relations hold also from the resulting system S 3 , i.e.: 

(lb) If E is a formula of S 2 , E° is E. 

(lib) b 3 E~E°. (Illb) {r h 3 E} -> {T 0 b 2 E°}. 

But Theorem 42 applies to S 3 (as its S 2 ), and the result of the elimi¬ 
nation of f from S 3 is S v Thus, using Remark 2 (b) if ' indicates some 
other than the principal f-less transform: 

(la) If E is a formula of 5 X , hj E' ~ E. 

(Ha) b 3 E~E'. (Ilia) {T |- 3 E} -*► {T' h E'}. 

Now (Via) —(Vllb) follow, e.g.: 

(Via) By (lib) and (Ila), h 3 E ~ E° — E°\ But E ~ E 0 ' is a 
formula of S v Hence by (Va), l-j E ~ E 0 '. 

(Vila) {r h 2 E}-> {T h E} (a fortiori) -> {V b E'} (by (Ilia)). 
The version with equality axioms follows by Theorem 41 (b), or can 
be treated directly thus. The' equality axioms for F are provable in S 3a . 
To pass to S 3 , we first add them as axioms, then replace (iii) by (i) 
and (ii), and finally omit the equality axioms for f (using Lemma 27). 

Remark 3. Any two f-less transforms E 1 and E 2 of E are equivalent 
in S v by the proof. Since this equivalence is already established in 
the system 5 X of (a) of the theorem, it suffices to satisfy the conditions 
of (b) for any one convenient way of choosing the transforms. 

Remark 4. For the version with equality axioms, and for F a pred¬ 
icate symbol: The entire discussion beginning with Theorem 42 and 
including the definition of ‘transform’ holds good, when, for certain 
values of i, we exclude the formation of terms f(t 1( ..., t„) and formulas 
F(t 1( ..., t n , s) with t i for any of these values of i containing f, and 
omit the equality axioms for f and F for these values of i. 

Remark 3. With equality axioms, in Theorem 43 (b) the additional 
postulates may make some of the n equality axioms for f redundant in 
S 2 . Are the corresponding equality axioms for F redundant in S x ? Sup¬ 
pose more particularly that for certain values of i there exist proofs of 
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the i- th equality axioms for f from the remaining postulates of S 2 in 
which proofs no f-terms f(t lt ..., t„) occur with t* for any of these values 
of i containing f. Then the equality axioms for F for these values of i 
are redundant in S v provided that (b) is satisfied when ' means prin¬ 
cipal f-less transform, or any other f-less transform in the altered 
sense of Remark 4. For then by Remark 4, using (VII) the principal 
f-less transforms of the equality axioms in question for f are provable 
in the altered S lf and thence the corresponding equality axioms for F, 

Example 11 . (a) Let S 2 be the full number-theoretic system or 

Robinson’s (Lemma 18b § 49). By (b) of the theorem, we can replace the 
function symbol • by a predicate symbol, in the following manner. 
Say the new predicate symbol is •, written preceding its three arguments. 
We adjoin as new axioms 

a=b 3 (•( c,d,a ) 3 • (c, d, b)), 3 \c-{a, b, c); 

and if we use principal --less transforms, we change Axioms 20 and 21 to 
3b[-(a,0,b) 8cb=0], 3c[• (a, b', c) &3</[- (a, b, d) & c=d+a]], 
respectively, but these may be simplified to 

•{a, 0, 0), 3 d[-(a, b', d+a) &-(a, b, d)]. 

The other two equality axioms 

a=b 3 {■{a,c,d)^-(b,c,d)), a=b 3 (•(, c,a,d) 0-{c,b,d)) 
for • as a predicate symbol need not be adjoined, if S 2 is the full number- 
theoretic system, since they are in fact provable in the system described 
(by Remark 5 with the proofs of the equality axioms for • as a function 
symbol § 38); but if S 2 is Robinson’s system, we adjoin them in place of 
the formulas of *106 and *107 as axioms. By a second application of the 
theorem we can further replace + . In doing so, if Axiom 18 is changed to 
its principal -{--less transform or to +(<z, 0 , a), a=a will still be provable 
(and will remain so if 0 is replaced similarly under (b) below). By a third 
application we can further replace '. In doing so, we change the induction 
schema say to A( 0 ) & Vx(A(x) 3 3 y['(x, y) & A(y)]) 3 A(x). Thus we 
obtain a system S x without function symbols in the ordinary sense, i.e. 
for n > 0 , related to S 2 by (Via) — (VUIb), where now ' and ° denote 
successive eliminations of several symbols, (b) If we wish a system lacking 
also individual symbols, we can still further replace 0 by an application 
with n — 0 , or we can eliminate 0 before replacing ' (next example). 

Example 12 . Using *137 and Axiom 15, 0—b ~ 'ia{a'^b) (which is 
of the form (iii)) is provable. This fact can be utilized to eliminate 0 by 
Theorem 42 after some preliminary transformations. 
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Example 13. Eliminability of a defined sort of variables. 
Let S x be the predicate calculus (say under formation rules using indi¬ 
vidual, function and predicate symbols) with additional particular axioms 
and axiom schemata. Suppose that for a certain formula M(w) containing 
free only w, the formula 3wM(w) is provable in S v Let S 2 come from S x 
by adjoining a new sort of variables a,b,c, . .., admitting these in 
constructing terms and formulas and for Rules 9 and 12, restricting the 
x for Axiom Schemata 10 and 11 to be a variable of the original sort, and 
adding the following three axiom schemata, where x is a variable of 
the new sort, A(x) is a formula, and t is a term free for x in A(x) and 
for w in M(w). 

0. M(x). To. M(t) D (VxA(x) D A(t)). IT. M(t) D (A(t) Z> 3xA(x)), 

Given any formula E of S 2 , let Et result from E by replacing each part 
of the form VxA(x) by one of the form Vx[M(x) D A(x)], and of 3xA(x) 
by 3x[M(x) & A(x)], where x is a variable of the original sort not oc¬ 
curring in A(x) or M(w); then let Et come from Et by prefixing 
M(y x ) & ... & M(y m ) D where y v ...,y m are exactly the distinct 
variables of the new sort which occur free in E (and hence in Et); and let 
E' come from Et by substituting for y v ..., y m distinct variables 
y x , ..y m °f the original sort not occurring in Et (and hence not in E). 
Then (I) — (III) hold with the following modifications, provided that 
for each additional axiom schema of S x , to each axiom A of S 2 by it, 
A' is provable in S,. For nt > 0 , (II) becomes: E yi '" y ”H H yi '" ym E'. 
In (III), if corresponding variables are to be held constant for cor¬ 
responding assumption formulas, for each y the same y should be 
substituted in E and each of T for which y is held constant (so in this 
case the operation ' is not specified for each single formula by itself). 
Before treating (II), Theorems 1, 2 and 14 can be extended appropriately 
to S 2 . We can use Lemma 8 a § 24 to reduce (III) to the case of it with 
T empty. To treat this, write Et as ‘‘Et(y 1 , ..., y m )”. We can show by 
induction that, if H 2 E, then M(y x ), ..., M(y m ) Hi Et(y x , ..., y m ) holding 
y x , ...,y m constant. (For Rule 2, say e.g. At contains only one new 
variable y, write it “At(y)”, and Bt none. By hyp. ind., M(y) Hi At(y) 
and M(y) Hi At(y) D Bt. By Rule 2, M(y) Hi Bt. By 3-elim. and *74, 
3wM(w) Hi Bt. But Hi 3wM(w). Hence Hi Bt.) — Similarly intro¬ 
ducing sever'al sorts of variables successively. 

§ 75. Axiom systems, Skolem’s paradox, the natural number 
sequence. Suppose we are dealing with an axiom system (§ 8 ) having 
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as its primitive or undefined notions a set or domain D of individuals, 
certain individuals z lt ..., z q from D, and certain predicates P v ..P s 
over D. An axiom of the system which can be expressed by a formula in 
the symbolism of the predicate calculus (i.e. the restricted or first order 
predicate calculus, cf. § 37) with individual symbols e x , ..., e 9 to express 
z lt ..., z q and predicate symbols Pr x , ..., Pr 3 to express P lt ..,, P s , 
respectively, we call elementary. If each axiom is elementary, and the 
axioms are finite in number, we call the axiom system elementary or 
of first order. 

Now we can always choose closed formulas to express elementary 
axioms, since under the generality interpretation any formula is syn¬ 
onymous with its closure (cf. end §32). Moreover, since the axioms of 
an elementary system are finite in number, we can form the conjunction 
F(e x , ..., e a , Pr x , ..., Pr s ) of the closed formulas expressing the axioms. 
Finally, by changing the individual symbols e 1( ..., e q to respective 
distinct variables z v ..., z Q not occurring in F(e 1( ..., e„, Pr x , ..., Pr,) 
and the predicate symbols Pr 1( ..., Pr s to distinct predicate letters 
P x , .... P 8 , we obtain a predicate letter formula F(z 1( ..., z 9 , P 1( ..., P s ) 
or briefly F. A simple illustration (with q — 0, s = 1) has already been 
given in § 37. 

Expressing the axiom system thus by a predicate letter formula 
F helps to emphasize the standpoint of formal axiomatics (§8), from 
which the set D, the individuals z v ..., z Q and the predicates P v ..., P s 
of the axiomatic theory are undetermined except as the axioms charac¬ 
terize them. Every predicate letter formula can be considered as express¬ 
ing an axiom system with the free variables and predicate letters oc¬ 
curring in it representing the undefined individuals and predicates. 

When we interpret the logical symbols classically, and treat the 
predicates “extensionally” as simply logical functions, the notions of 
set-theoretic predicate logic (§ 37) become applicable to the discussion of 
axiomatic systems. To say that the axioms are satisfied (in the intuitive 
sense, § 8) by some non-empty system of objects (which we also ex¬ 
pressed in § 8 by saying that the axioms are ‘non-vacuous’) now means 
exactly that the formula F is satisfiable (in the set-theoretic sense) in some 
non-empty domain. Predicate letter formulas which are satisfiable but 
not valid are the ones which are of interest as axiom systems. A valid 
predicate letter formula does not restrict or characterize the individuals 
and predicates expressed by its free variables and predicate letters; but 
rather expresses a law of logic applicable to all choices of those individuals 
and predicates in any non-empty domain. 



422 


AXIOM SYSTEMS 


CH. XIV 


In formal axiomatics, without the further step of formalizing the proc¬ 
esses of logical deduction so as to obtain a formal system (§ 15), theorems 
are deduced from the axioms on the basis of the meanings of the logical 
symbols. What it means on this basis for a proposition to be a theorem 
can be expressed in set-theoretic predicate logic as follows. Consider any 
proposition of the axiomatic theory which is expressible by a predicate 
letter formula in ..., P,. We can always choose that formula to 
contain no variables free other than z lt ..., z q , and to contain these only 
free. Now any such formula B(z 1 , ..., z„ P 1( ..P s ) or briefly B ex¬ 
presses a true proposition or theorem of the axiomatic theory, precisely 
if every assignment of individuals z v ..., z Q from some non-empty 
domain D to z v ..., z q and of predicates P v ..., P s over D to P 1( ..., P 4 
which satisfies F also satisfies B. In view of the valuation table for D 
§ 28, this is equivalent to saying that the formula F D B should be valid 
in every non-empty domain. 

Now suppose that the axiomatic theory is formalized (§ 15) by adopting 
the deductive rules of the predicate calculus as the means of deducing 
theorems, under the proviso that the variables z v ..., z q be held constant; 
i.e. we now say proof-theoretically that B expresses a theorem, if in the 
predicate calculus F |- B with z v ..., z q held constant. (By Remarks 
1 and 2 (a) §34, we can then always find a deduction of B from F in 
which no predicate letters other than P v ..., P s occur and z v ..., z„ 
occur only free.) 

By the D-rules, noting that, since F contains no variables free except 
z v ..z q> no variables are varied, this is equivalent to saying that 
|- F 3 B in the predicate calculus. 

We are now in a position to establish that the formalization of deduction 
for elementary axiomatic theories by the predicate calculus is both 
correct (or consistent) and adequate (or complete), i.e. the predicate cal¬ 
culus enables only and all those formulas to be deduced from F which 
express propositions that are true of any system satisfying the axioms. 
For (h FdB} = {FdB is valid in every non-empty domain}, 
by Theorem 21 § 37 and Corollary 1 Theorem 34 § 72. 

The question whether the theorems are consistent with the axioms 
(just answered affirmatively) is of course quite separate from the question 
whether the axioms themselves are consistent. Prior to Hilbert’s proof 
theory or metamathematics, proofs of consistency of an axiomatic 
system or theory were by exhibiting a model for the theory (§ 14). The 
consistency property proved immediately in this case is the satisfiability 
of F in some non-empty domain. 



§75 


SKOLEM S PARADOX, THE NUMBER SEQUENCE 


423 


We gave a heuristic argument in § 14 that this property implies con¬ 
sistency in the sense of non-existence of a contradiction (one theorem 
denying another) in the theory deducible from the axioms. The converse 
that from any unsatisfiable system of axioms a contradiction must 
necessarily follow by a finite number of logical steps was then by no 
means clear. 

It is only with the step taken by the modern formalists of formalizing 
deduction that consistency in the sense of non-deducibility of a contra¬ 
diction becomes amenable to exact discussion. We now nave as the con¬ 
sistency property that for no formula A, both F j- A and F p -i A with 
z v .. .,z t held constant. 

By the -i-rules § 23 (since F contains only z v . .., z, free), this property 
is equivalent to 'not |- -iF’, i.e. to ‘F is irrefutable’. The formalist’s 
transformation of the consistency problem may thus be described (for the 
case of elementary axiom systems) as the replacement of satisfiability 
by irrefutability. 

By Theorem 21 (which takes the place now of the reasoning given in 
§ 14) and Godel’s completeness theorem (Theorem 34), satisfiability 
and irrefutability are equivalent. 

The purpose of the formalistic transformation in the notions of de¬ 
ducibility and consistency is to obtain notions which are finitary. 
A reduction from the non-enumerably to the enumerably infinite is 
achieved, as validity and satisfiability refer to the totality of logical 
functions, which is non-enumerable, while the proof-theoretic equivalents 
provability and irrefutability refer only to the enumerable infinity of 
formal proofs. In metamathematics, the reasoning with the notions is 
also finitary. Although the equivalence proof, as given by Godel’s com¬ 
pleteness theorem, cannot belong to metamathematics, it is significant 
for metamathematics that the set-theoretic notions are actually equiva¬ 
lent to the proof-theoretic ones when one reasons on the non-finitary 
plane to which the set-theoretic notions belong. 

We see now that the decision problem for provability in the pure 
predicate calculus includes the decision problem for provability in every 
axiomatic theory having an elementary axiom system (by asking whether 
a certain predicate letter formula F D B is provable), and also the de¬ 
cision problem whether any given elementary axiom system is consistent 
(by asking whether -iF is unprovable). 

Axiom systems used in mathematics often employ = in the role of 
an ordinary or logical term which must be understood in advance 
rather than as one of the undefined predicates which the axioms charac- 
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terize. The foregoing remarks will apply, if we first supply some additional 
axioms forequality, formalized as Eq(=, Pr 1( ..., Pr s ) or Eq(Q, P x ,..., P„); 
or we can instead formalize the axioms as they stand by an equality and 
predicate letter formula, and deduction from the axioms by the predicate 
calculus with equality, and then use the extension of Godel’s completeness 
theorem to that. The two methods give results which are equivalent via 
Lemma 24 (a) and Theorem 41 (c) § 73, though set-theoretically the former 
does not narrow the interpretations which satisfy the axioms to those in 
which Q is equality but allows Q also to be an equivalence relation. 

Example 1 . The axiom system LI—L3 for linear order (end § 8 ) 
is expressed by the following equality and predicate letter formula 
(call it “F(=, o si)”), with expressing <: 

VaVbVc[^{a,b) & a{b,c) 3 £*(a,c)] & V<*Vj[-i(<3f(<i,£) & a=b) & 

-i (<3{a,b) & <ar( b,a )) &-i(a=b& e2( b,a ))] & VaVb[£l(a,b)Va=b V ^(b,a)]. 
The same axiom system is expressed by the predicate letter formula 
Eq(£, a) & F(B, <£?), with S expressing = (cf. Example 1 §73). 

Our remarks also apply indirectly to axiom systems having functions 
f v ..., f r among their primitive notions, as with the help of = these 
can be replaced by the representing predicates of the functions, as was 
discussed from the proof-theoretic standpoint in Theorem 43 § 74. 

Elementary axiom systems occur frequently in mathematics, if we use 
the term 'elementary’ more widely to include systems which can be 
transformed by well-known devices so as to become elementary in the 
sense formulated at the beginning of the section. For example the axioms 
for groups, and Hilbert’s axioms for geometry with the continuity axiom 
omitted, are elementary in the wider sense. (In the first, the group 
operation can be replaced by its representing predicate, using =; and 
in both, axioms for = can be supplied. An example is worked out in 
Hilbert-Bernays 1934 pp. 3— 8 , 380—381.) 

The foregoing discussion for the case of an elementary axiom system 
can be paralleled for the case of an enumerable infinity of elementary 
axioms as follows. Let F 0 , F lf F 2 , ... be predicate letter formulas express¬ 
ing the respective axioms and containing free only the variables z 0 , z 1( 
z 2 , ... which stand for the undefined individuals of the axiomatic 
theory. Now {B is a "theorem” set-theoretically} = (every assignment 
which satisfies all of F 0 , F 1( F 2 , ... satisfies B} = (for every assignment, 
one of B, -iF 0 , -iF 1( -iF 2 , ... is t} = (some disjunction of a finite 
number of B, —iF 0 , -iF lt -nF 2 , ... is provable} (by Theorem 21 and 
Corollary 1 Theorem 37) 5= {|- F 3 B for some conjunction F of a finite 
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number of F 0 , F x , F 2 , ...} (using *62, *59) = (F 0 , F x , F 2 , ... b B 
with z 0 , z x , z 2 , ... held constant} s- {B is a “theorem” proof-theoreti- 
cally}. Similarly {the axioms are “consistent” set-theoretically} = 
{F 0 , Fj, F 2 , ... are jointly satisfiable} ^ {every conjunction F of a 
finite number of F 0 , F x , F 2 , ... is irrefutable} (by Theorems 21 and 37) ss 
{for every A, not both F 0 , F 1( F 2 , ... f- A and F 0 , F x , F 2 , ... \- -iA, 
with z 0 , z x , z 2 , ... held constant} = {the axioms are "consistent” proof- 
theoretically}. Thus as before the set-theoretic and proof-theoretic 
notions are equivalent. But the former method of reducing the decision 
problems for deducibility from the axioms and for consistency of the 
axioms to that for provability in the predicate calculus fails, since now 
there are quantifications with respect to the finite conjunctions F of 
F 0 , F x , F 2 , ... (however cf. Remark 3 § 76). 

Using the starred forms of Theorems 21 and 37 (cf. Theorem 39 § 73), 
these results extend to the case = is used as a logical notion, and the 
axioms are expressed by equality and predicate letter formulas. 

Axiomatic set theory. The axiom systems for set theory of von 
Neumann * 925 , of Bernays 1937-48 and of Godel 1940 * are elementary 
(in the wider sense). 

As Godel’s axioms are stated, there are three primitive notions, (£13 
(to be a class), - 5 K (to be a set) and £ (to belong to), besides which = is 
used as a logical notion. All sets are classes, and no other objects are 
considered; so that the classes constitute the domain. The axiom system 
can then be expressed by an equality and predicate letter formula 
F(=, <£7, S) where <£?(<*) and S(a, b) express 9J?(a) and a 8 b, respectively, 
or by a predicate letter formula Eq(C, <3, S) & F(C, <C 7 , 2) where C(a, b) 
expresses a=b. 

This axiom system is extremely powerful. From it with appropriate 
definitions the usual classical analysis and much of general set theory can 
be deduced. In particular, the existence of an infinite set is postulated (by 
the axiom of infinity), and also the existence to any set of a set which in¬ 
cludes the subsets of that set; so it is deducible via Cantor’s theorem 
(Theorem C § 5) that there exists a non-enumerably infinite set of sets. 

But by Lowenheim’s theorem (Corollary 2 Theorem 34*, cf. Theorem 
39), if the formula F(=, 2) expressing the axioms is satisfiable at all, 

as it appears to be from its presumed interpretation by set theory, it is 
satisfiable in a finite or enumerably infinite domain. (Examination of 
the axioms rules out the case of a finite domain.) Thus we can interpret 
the primitive notions so that there are only enumerably many sets and the 
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axioms are all true (i.e. an enumerable model exists for axiomatic set 
theory, with = in its usual meaning), even though a theorem in the 
theory asserts that there are non-enumerably many sets. This is the 
Skolem “paradox” (1922-3). 

By the Skolem extension of Lowenheim’s theorem to the case of 
joint satisfiability of an enumerable infinity of formulas F 0 , F 1( F 2 , ... 
(Corollary 2 Theorem 37 *), the “paradox” applies equally to axio- 
matizations of set theory using infinitely many axioms, such as those of 
Fraenkel 1922 and of Skolem 1922-3. 

Light is shed on the “paradox” by two observations. Only those par¬ 
ticular subsets of a given set are definable within the axiomatic theory 
which can be constructed by operations, or separated out from the set by 
properties (i.e. predicates), available in the theory. The basic operations 
for building sets (or processes for constructing predicates) provided by the 
axioms are finite or at most enumerably infinite in number. The iteration 
of them then give the means for defining only enumerably many subsets 
of a given set. This explains the possibility of interpreting the axiom 
system, i.e. of satisfying the formula(s) expressing the axioms, in an 
enumerable domain. 

On the other hand, to enumerate a set is to give a 1-1 correspondence 
of the set with a particular enumerable set, say the set of the natural 
numbers (§ 1). A 1-1 correspondence can be considered as the set of the 
corresponding pairs. 

Thus it may be possible for the subsets of a given infinite set definable 
within the theory to be enumerable from without the theory, and yet 
be non-enumerable within the theory, because no enumerating set of 
corresponding pairs is among the sets definable within the theory. The 
construction of the enumerating set of pairs is accomplished by taking 
into account the structure of the axiom system as a whole, and this 
construction is not possible within the theory, i.e. using only the operations 
provided by the axioms. 

The situation is similar to that in Godel’s incompleteness or un¬ 
decidability theorem (Theorem 28 § 42 ), where, if we suppose the number- 
theoretic formal system to be consistent, we can recognize that A P (p) 
is true by taking into view the structure of that system as a whole, though 
we cannot recognize the truth of Aj,(p) by use only of the principles of 
inference fdrmalized within that system, i.e. not |- Aj,(p). 

Although there is this "explanation”, the “paradox” still confronts 
us with the following alternative. Either we must maintain that the 
concepts of an arbitrary subset of a given set, and of a non-enumerable 
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set, are a priori concepts which elude characterization by any finite or 
enumerably infinite system of elementary axioms; or else (if we stick to 
what can be explicitly characterized by elementary axioms, as we may 
well wish to in consequence of the set-theoretic paradoxes § 11) we must 
accept the set-theoretic concepts, in particular that of non-enumerability, 
as being relative, so that a set which is non-enumerable in a given 
axiomatization may become enumerable in another, and no absolute 
non-enumerability exists. This relativization of set theory was proposed 
bv Skolem (1922-3, 1929, 1929-30). 

The Lowenheim theorem, since it leads to Skolem’s “paradox”, 
can be regarded as the first of the modern incompleteness theorems. 
For further discussion, see Skolem 1938. 

Axiomatic arithmetic. Postulate Group B of our formal number- 
theoretic system provides an example of an axiom system for the theory 
of the natural numbers consisting of an effectively enumerable infinity 
of elementary axioms, i.e. the formulas expressing the axioms are ef¬ 
fectively enumerable (cf. Theorem 38 § 72 ). The functions are of course 
replaceable by their representing predicates. Ryll-Nardzewski 1952* 
shows that no finite subset of these axioms would suffice for the deduction 
of the same class of theorems. 

Another question is whether these axioms do completely characterize 
the natural number sequence. Godel’s completeness theory for the 
predicate calculus provides us with a proof of the following theorem, 
which was originally obtained in another way by Skolem (1933, 1934; 
cf. 1938). 

We shall consider axioms for the sequence of the natural numbers 
(call the set of them N), using as primitive notions the individual 0 and 
the predicate a'=b, i.e. in the X-notation (§ 10 ) \ab a'=b, and perhaps 
other primitive notions. The axioms shall be expressible by equality and 
predicate letter formulas, with z expressing 0 and P 0 (a, b) expressing 
a'—b. 

In discussing assignments to the free variables and predicate letters 
of any such formula, we let “D” stand for the domain, “z” for the in¬ 
dividual assigned to z, and “P 0 (a, b)” for the predicate assigned to P 0 (a, b). 
Then (D, z, P 0 {a, b)) is a mathematical system in the sense of § 8, con¬ 
sisting of a set or domain, a member of the set, and a binary predicate 
over the set. 

Theorem 44 c . Any finite or effectively enumerable infinite class of 
equality and predicate letter formulas which can be jointly satisfied so that 
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(D, z, P 0 (a, b )) is ( N , 0, a'=b) can also be jointly satisfied (with 0 < D < H 0 ) 
so that ( D, z, P 0 (a, b)) is not isomorphic to ( N, 0, a'—b). 

Proof. By hypothesis, there is a satisfying assignment for the given 
formulas jointly in which the domain D is N, z has the value 0, and 
P 0 {a, b) has the value a'—b. 

Let P 1( P a and P 3 be other distinct predicate letters, which either do 
not occur in the given formulas, or have the respective values a-\~b—c r 
a-b—c and ( x)(Ey)T z (a , a, x, y) in the given assignment. We shall extend 
the given class of formulas by adding seven formulas (if not already- 
included) which are satisfied when the given assignment is extended 
(if necessary) by assigning P x , P 2 and P 3 the values just mentioned. 

We add four closed formulas, say 

,, VaP 1 (a,z,a), VaVb3c3d3e[P 0 (b,c) & P 1 (a,c,d) & P^b.e) & P 0 (e,d)], 
VaP 2 (a,z,z), VaVb3c3d3e[P 0 (b,c) & P 2 (a,c,d) & P 2 (a,b,e) & P 1 (e,a,d)] 
(cf. Example 11 (a) §74), which under the described assignment express 
the recursion equations for a-\-b and a-b as paraphrased in terms of the 
representing predicates a+b=c and a-b—c, together with the two 
formulas 

(b) VaVb3!cP 1 (a, b, c), VaVb3!cP 2 (a, b, c), 

which express that a-\-b—c and a-b—c are representing predicates. 
Since T 2 (a, b, x, y) is primitive recursive, by Corollary Theorem I § 49 
it is arithmetical (§ 48), and so (replacing ', + and • by their representing 
predicates) we can find a letter formula T a (a, b, x, y) in =, P 0 , P 1( P 2 
which expresses it under the described assignment. We then add the 
formula 

(c) Va(P 3 (a) ~ Vx3yT a (a, a, x, y)). 

Let the formulas of the resulting (enlarged) class be F 0 , F lt F a , .... 
As in the proof of Lowenheim’s theorem (Corollary 2 Theorem 37*), 
the hypothesis of Theorem 37* is satisfied; and we use Theorems 37* and 
40 to obtain another satisfying assignment for F 0 , F x , F a , ... jointly. 
Let £)*, z*, P*, P*, P*, P* be respectively the domain and the values 
of z, P 0 , P x , P a , P 3 in this. 

Suppose (for reductio ad absurdum) that ( D *, z*, P*{a*, b*)) is 
isomorphic (§ 8) to (N, 0, a'=b), i.e. D* is infinite and can be enumerated 
as s 0 , s 1( s 2 , .•.. so that z* = s 0 and P*{s a , s b ) = a'—b. 

Then by Theorem 40, 

(i) P*(s a ) = (Ex) (y)R*(a, x, y) 

for some primitive recursive R*. 
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In § 43 we reasoned that when the variables range over the natural 
numbers, and 0 and ' have their usual meanings, the recursion equations 
for + and • have the usual functions + and • as their unique solution. 
Since the formulas (a) and (b) are satisfied, that reasoning (with minor 
rearrangements to fit the use of the representing predicates instead of 
the functions) applies now to show that P* ( s a , s b , s c ) = a+b=c and 
P*(s a , s b , s c ) = a-b—c. Then similarly our proof of Theorem I § 49 
shows that the formula T 2 (a, b, x, y) now expresses a predicate 
T*(a*, b*, x*, y*) such that T%{s a , s b , s x , s v ) = T t {a, b, x, y). Hence 
Vx3yT„(a, a, x, y) expresses a predicate T*(a*) such that 

(ii) T*(s a ) = (x)(Ey)T 2 (a, a, x, y). 

By the valuation rules for ~ (Example 1 § 28) and V, since (c) is 
satisfied, 

(iii) P*{a*) = T*{a*). 

Combining (i) — (iii), (Ex)(y)R$(a, x, y) = (x)(Ey)T 2 (a, a, x, y). But by 
Theorem V (16) §57, the predicate (x)(Ey)T 2 (a, a, x, y) is not expressible 
in the other 2 -quantifier form; for a certain number g, 

( Ex){y)R*{g,x,y ) =$= (x){Ey)T 2 {g, g, x, y). 

By reductio ad absurdum, (D*, z*, P*(a*, b*)) is not isomorphic to 
{N, 0 , a'=b). 

Discussion. By the theorem, no finite or effectively enumerable 
infinite set of elementary axioms can characterize the natural number 
sequence 0, 1,2, ..., a, a', .... Any such set which are true of the natural 
number sequence must also be true under another interpretation. We 
stated the theorem for the natural number sequence as a system of the 
form ( N, 0, a’—b), but by the replaceability of a' by a'=b it applies 
also to the natural number sequence as a system of the form ( N , 0 , '). 
In particular, the axioms of Postulate Group B of our formal number- 
theoretic system (§ 19) admit an interpretation (using the logical symbols 
and = in their usual meanings) other than the intended one. 

This incompleteness of Postulate Group B as a characterization of 
the natural number sequence is understandable when we compare 
Peano’s fifth axiom (the principle of mathematical induction, § 7) with 
Axiom Schema 13. Peano’s fifth axiom asserts that 
(I) A(0)&(x)(A(x)-+A(x'))^(x)A(x) 

holds for all number-theoretic predicates A (x). These predicates constitute 
a non-enumerable totality. But the bundle of axioms given by Axiom 
Schema 13 only express that (I) holds for those predicates A{x) which 
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are expressible by formulas A(x) of the system, i.e. only for enumerably 
many predicates. Peano’s fifth axiom is not elementary. We can express 
it in the symbolism of the second order predicate calculus (§ 37), by using 
a generality quantifier with a predicate variable <77, thus: 

V<77[<3(0) & V*(<37(x) 3 a{x')) 3 Vx«3f(*)]. 

These ideas have a connection with Godel’s theorem on formally 
undecidable propositions (Theorem 28 or 29 § 42). 

One may think of the formula A p (p) or A q (q) (which is true but un- 
provable, if the number-theoretic system is simply consistent) as ex¬ 
pressing a proposition which can be "proved” from Peano’s axioms, 
but only by making use of induction with some induction predicate 
A{x) which is not expressible in the system under the intended inter¬ 
pretation. (This suggestion will receive confirmation later; cf. end §79, 
noting (II) § 42.) 

The unprovability of A v (p) becomes understandable also from Skolem’s 
result (Theorem 44), on the ground that A „(p), although true of the nat¬ 
ural numbers, is false under one of the other interpretations which 
satisfy the axioms. Then the undecidability of A P (p) in the number- 
theoretic formalism appears as a phenomenon of the same kind as the 
impossibility of proving either Euclid’s parallel postulate or its negation 
from the other axioms of geometry (§ 8 ). The given axioms are not 
categorical. 

Indeed conversely, as remarked above (using Corollary 1 Theorem 37*), 
a formula is provable, if it is true under all interpretations which make 
the axioms true. So the known unprovability of A „(p) makes it absurd 
that A „(p) should be true under all the interpretations which satisfy the 
axioms. Thus Godel’s theorems 28 and 37* afford another proof of 
Theorem 44 for the case the class of formulas for Theorem 44 is Postulate 
Group B of our formal system (restated as equality and predicate letter 
formulas). But (as remarked above, using Theorem 21 *), if any enumer¬ 
able class of equality and predicate letter formulas are jointly satisfiable, 
the formal system obtained by adjoining them as axioms to the predicate 
calculus with equality is (simply) consistent. So given any class of 
formulas for Theorem 44, by adjoining Postulate Group B, and carrying 
out the proof of Theorem 28 in the resulting system (or using Theorem 
XIII Part III § 60), we get Theorem 44 in general. 

Mostowski 1949 gives an interesting example (suggested by Skolem’s 
"paradox”) of a proposition in axiomatic set theory, which he demon¬ 
strates to be undecidable by showing it to be true under one interpretation 
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and false under another. Kreisel 1950 deals with similar problems. 

The proof of Theorem 44 we gave first, and that based on Theorem 28 
(or XIII), are stated for the case of an elementary axiom system or one 
having an effectively enumerable infinity of elementary axioms. Skolem’s 
proof does not restrict the enumeration of the axioms to be effective. 
The additional generality is not essential, when we are considering the 
theorem as an incompleteness theorem for formal axiomatics from the 
standpoint that the aim of axiomatization is to make the assumptions of 
the theory explicit. 

Remark 1 . Our (first) proof of Theorem 44 can be modified to secure 
additional generality. Suppose now the class of the formulas for Theorem 
44 is merely arithmetical, in the sense that, under a Godel numbering 
established by the methods of §§ 52 and 56, the predicate 'x is the Godel 
number of a formula of the class', call it "C(x)”, is arithmetical. Then 
by Theorem VII (d) § 57, C(x) is expressible in one of the forms of 
Theorem V, say a ^-quantifier form. Let B(x) refer similarly to the given 
class of formulas extended by adding some finite list (as above we added 
the seven (a) — (c)); then, however that list is chosen, B(x) is expres¬ 
sible in the* same ^-quantifier form. By Theorem XIV (b) §60 (taking 
R(x, y) = B(x)), the class xB(x) is enumerated by a function 0(A) re¬ 
cursive in ^-quantifier predicates (in the case of the ^-quantifier form with 
existence first, even in q —\ -quantifier predicates). But the proofs of 
Theorems 38 and 40 hold good, when the hypothesis that the enumeration 
F 0 , F 1( F 2 , ... is effective is omitted, and the conclusion is altered by 
changing "primitive recursive” to "primitive recursive in 0 ” where 
0 (A) is the Godel number of F*; and hence (using Theorem XI § 58, and 
(17) and (18) §57) if 0 is recursive in ^-quantifier predicates, when the 
alteration in the conclusion consists in substituting the q-\- 2 - for the 
2 -quantifier forms. So the proof of Theorem 44 goes through now, by 
using a case of Theorem V for a q+2- instead of a 2 -quantifier form. — 
We can still further generalize the C{x) for Theorem 44 to be arithmetical 
in the predicate M of Theorem VIII § 57, including among the added 
formulas three to express the definition of M, and using Theorems I*, V*, 
VII*, XI* (with M as the Y) instead of I, V, VII, XI. (The second proof 
of Theorem 44 can also be carried out under more general hypotheses, 
by using generalizations of Theorem 28 or XIII to "non-constructive 
logics”.) 

Godel’s undecidability theorem however is not restricted to the case 
the axioms are elementary (Theorem XIII § 60). But we can characterize 
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the natural number sequence completely by Peano’s axioms, the fifth 
of which is non-elementary, if we grant the notion of all predicates over 
the domain. Suppose we have a consistent formal system containing 
these categorical axioms for the natural numbers. Under the conception 
of a formal system which we are entertaining, a formal system may 
have only an enumerable infinity of formal objects. So only an enumerable 
infinity of formulas can be substituted in the system for the predicate 
variable < 3 ? of the fifth Peano axiom. Thus for deductive purposes within 
the system, just as before, (I) is available only for an enumerable infinity 
of predicates. In fact, by Godel’s theorem (Theorem XIII), there is a 
consequence of the axioms under the interpretation which is not provable, 
i.e. not deducible from the axioms by the logic formalized in the system. 
So when we have non-elementary axioms, not all formulas need be prov¬ 
able which are true under all interpretations which satisfy the axioms. 
(In our example, there is essentially just one such interpretation.) The 
incompleteness which appeared in the axiom system in the case of 
elementary axioms is transferred to the deductive apparatus, if we 
undertake to avoid it by using non-elementary axioms. 

As Skolem expresses it (1934 p. 160 ), "... the [natural number] 
series is completely characterized, for example, by the Peano axioms, 
if one regards the notion 'set’ or ‘propositional function’ as something 
given in advance with an absolute meaning independent of all principles 
of generation or axioms. But if one would make the axiomatics conse¬ 
quent, so that also the reasoning with the sets or propositional functions 
is axiomatized, then, as we have seen, the unique or complete character¬ 
ization of the number series is impossible.” 

This situation is discussed in Henkin 1950, which came to the author’s 
attention after this section was written (the first draft in 1947 ). Other 
papers are e.g. Mostowski 1947a (cf. Kemeny’s review 1948) and Rosser 
and Wang 1950 (cf. Skolem’s review 1951). 

§ 76. The decision problem. Theorem 54. The decision problem 
for the pure predicate calculus [pure predicate calculus with equality) is 
unsolvable, i.e. there is no decision procedure for determining whether a 
Predicate letter formula {an equality and predicate letter formula ) is provable 
in the calculus. (Church 1936a, Turing 1936-7.) 

Proof, for the predicate calculus. We saw in § 75 that the decision 
problem for provability in any axiomatic theory having an elementary 
axiom system reduces to that for provability in the pure predicate cal- 
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cuius. Thus to prove this theorem of Church, it will suffice to find an 
elementary axiomatic theory for which, on the basis of Church’s thesis 
(§ 60), there can be no decision procedure. 

According to Theorem 33 §61, an example of such an axiomatic 
theory is provided by the formal system of Robinson described in Lemma 
18b § 49, if that system is simply consistent. A proof of its simple con¬ 
sistency, which the present theorem is numbered to follow, will be given 
in § 79 (Theorem 53 (a)). 

We repeat the reasoning in detail (already outlined in §75). Let S 2 
be the system of Lemma 18b with thirteen particular axioms. 

(A) By applications of Theorem 43 (see Example 11 (a) §74), we 
find another system S x in which the function symbols ' are replaced 
by respective predicate symbols, the number of the axioms being in¬ 
creased to nineteen. By (Villa), a formula E of S 2 is provable in S 2 , if 
and only if the formula E' of Si is provable in S v 

(B) We can then replace the axioms by their closures, without 
changing the provability notion (end § 32). 

(C) Moreover, by the &-rules, since the axioms are finite in number, 
we can likewise replace them by their conjunction as a single axiom. 

(D) We can furthermore change the notation to employ a variable z 
not occurring in the axiom(s) in place of the individual symbol 0 , with the 
understanding that then provability shall mean deducibility from the 
axiom(s) in the predicate calculus with z held constant. If C be the result 
of making this change in a formula D of S t not containing z, then by 
Remark 2 (b) § 34, C is provable now, if and only if D was provable 
before. (This treatment of 0 separately from the function symbols can 
be avoided by using instead Example 11 (b) § 74 in (A).) 

(E) We may further change the notation to employ predicate letters 
in place of the predicate symbols. If B results from a formula C by this 
change of notation, then by trivial applications of Theorems 15 and 16 
§ 34, B is now provable, if and only if C was before. (In § 75 we took 
B to contain no variables free except z, but that was done to make the 
set-theoretic notions apply properly, and is unnecessary now.) 

(F) Finally, by the 3-rules, B is provable in the last described 
system, i.e. F f- B in the predicate calculus with z held constant, where F 
is the formula (containing only z free) expressing the axioms now, if 
and only if F 3 B is provable in the predicate calculus. 
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The whole process by which from the formula E of S 2 we find a pred¬ 
icate letter formula F D B such that {f- E in S 2 } = {|-FdB in the 
predicate calculus} is effective. (It could be represented via Godel number¬ 
ing by a general recursive function as discussed in § 61, and it could be 
effected by a Turing machine as at end § 70.) Therefore, if there were a 
decision procedure for provability in the pure predicate calculus, there 
would be one for S 2 , which would consist, given a formula E of S 2 , in 
finding the corresponding predicate letter formula F 3 B, and applying 
the procedure for the predicate calculus to the latter. But by Theorem 33, 
if S 2 is simply consistent, there is no decision procedure for provability 
in S 2 . 

The argument applies also to the predicate calculus with equality, 
since by Theorem 41 (b) provability in the system of Lemma 18b is 
equivalent to provability in a system consisting of the predicate cal¬ 
culus with equality and seven particular axioms. 

Remark 1 . The order in which the reduction steps (A) — (F) are 
performed is immaterial, so long as (B) and (C) precede (F), (A) precedes 
(B), (C) and (E), and if (D) precedes (B) the variable z is exempted from 
the closure operation of (B). (Or (C) may be omitted, and (F) per¬ 
formed once for each axiom; cf. *4 and *5 § 26.) 

Remark 2 . The proof of Theorem 54 may be based on Theorem XII 
§ 60 instead of Theorem 33, thus. By the foregoing reductions (A) — (F) 
on the system of Lemma 18b with Example 2 § 60, or by (the method of) 
those reductions on the system of Example 2 § 73, and using the con¬ 
sistency property to be established in § 79 (Theorem 53 (b) or Theorem 
52): For any fixed primitive (or general) recursive predicate R(x, y), there 
is an effective procedure by which, given any number x, a predicate letter 
formula K x can be found such that 

(1) (Ey)R(x,y) = {\- K x in the predicate calculus). 

Theorem 54 then follows from Theorem XII by taking R(x, y) = T^x, x, y). 
This proof from Theorem XII with Example 2 § 73 is essentially Church’s 
original proof; that from Theorem 33 essentially Mostowski and Tarski’s 
(1949 abstract). 

Remark 3. Our conception of a formal system S implies that the 
formulas of S should be effectively enumerable, or admit a Godel number¬ 
ing, so that given any formula A we can effectively find its number x, 
and inversely given any number x we can effectively decide whether it 
is the Godel number of a formula and if so find that formula A,.. Then 
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(cf. Remark 1 (a) § 60): There is a general recursive predicate R such that 
( 2 ) {f- A* in S} = ( Ey)R(x,y ). 

So the decision problem for provability in any system S is equivalent 
to that for a predicate of the form ( Ey)R(x, y). By Remark 2, or by the 
(first) proof of Theorem 54 with the latter part of Example 3 § 61, the 
decision problem for the predicate calculus is of the highest degree of 
unsolvability for predicates of this form (cf. preceding Example 3 § 61). 
Thus (combining ( 1 ) and (2)), the decision problem for provability in 
any formal system is reduced to that for the predicate calculus (either 
classical or intuitionistic). This generalizes our remark (§ 75) that the 
problem for any axiomatic theory with an elementary axiom system re¬ 
duces to it; but of course the reductions by going out of the system to a 
Godel numbering via ( 2 ), and thence into the predicate calculus via ( 1 ) 
are very indirect. 

By §§ 37, 72, 73, 75, provability in the predicate calculus with equality 
is equivalent to validity in every non-empty domain; so now there is 
no decision procedure for the latter property of an equality and predicate 
letter formula. Trahtenbrot 1950 proves the analogous theorem for validity 
in every non-empty finite domain. 

Reductions and special cases. Because so many particular 
questions (e.g. Fermat’s “last theorem’’ § 13) and decision problems 
reduce to the decision problem for the predicate calculus, much work 
has been done on it, leading to positive results of two sorts: (a) reductions 
of the general problem, and ((3) solutions of special cases. The results are 
often presented in a dual set-theoretic form, in which the problem is to 
decide as to the satisfiability of a predicate letter formula in some non¬ 
empty domain (§§ 72, 75), rather than as to its provability. 

An early example of (a) is Skolem’s normal form (Skolem 1920 , Hil- 
bert-Bernays 1934 pp. 158 ff). The Skolem proof-theoretic (satisfaction- 
theoretic) normal form is a prenex formula (Theorem 19 §35) in which 
all the existential (generality) quantifiers come first. Given a predicate 
letter formula G, there can be found effectively a predicate letter formula 
M (N) of this form, such that M is provable in the predicate calculus 
(N is satisfiable in a given domain), if and only if G is. Thus the decision 
problem for provability (satisfiability) for predicate letter formulas 
generally is reduced to the same problem for Skolem normal forms 
(cf. the latter part of § 61). The normal form M (N) is not in general 
equivalent to G, but M (~iN) is interdeducible with G (- 1 G) in the 
predicate calculus with a postulated substitution rule (§37). Skolem 



436 


AXIOM SYSTEMS 


CH. XIV 


used his satisfaction-theoretic normal form in simplifying the proof of 
Lowenheim’s theorem and generalizing it, and Hilbert and Bernays 1939 
employ it in proving Godel’s completeness theorem in a way which makes 
the formalization referred to in the proof of Theorem 36 § 72 reasonably 
simple. 

An example of ((S) is the solution of the decision problem by Lowenheim 
1915 (simplified by Skolem 1919), and independently by Behmann 1922, 
for the case of predicate letter formulas containing only predicate letters 
with 0 or 1 argument. Equivalently, by Remark 1 § 34 , the decision prob¬ 
lem is solved for the 1 -place predicate calculus, i.e. the calculus with 
only 0- and 1-place predicate letters. (Cf. end § 72 , Hilbert-Bernays 1934 
pp. 179 — 209 .) 

Reductions and special cases of the decision problem have remained 
an active field of research, since Church showed that there can be no 
general solution (Theorem 54 ). The literature is too extensive to be cited 
here, and the reader is referred to Church’s bibliography and the review 
sections of the Journal of Symbolic Logic (cf. the preface to the bib¬ 
liography of this book). Quite a number of the results are described 
in Hilbert-Bernays 1934 and 1939. Church 1951 discusses special cases. 

Axiomatic theories. We take up now a method of Tarski (1949 
abstract) for investigating the decision problems for axiomatic theories. 
We shall consider theories formalized on the basis of a logical calculus, 
which may be either the predicate calculus or the predicate calculus with 
equality. (Tarski uses the latter.) We shall usually say “formal system S” 
for uniformity with our previous terminology, where Tarski says “theory 
X” which emphasizes the mathematical application. 

By the logical constants we shall mean the six logical symbols 3 , &, 
V, -1, V, 3 , if the logical calculus is the predicate calculus; these and 
also —, if it is the predicate calculus with equality. The terms and formulas 
of a system are to be constructed using besides these logical constants 
a finite number of individual, function and predicate symbols, called the 
non-logical constants (but no predicate letters). The postulates besides 
those of the logical calculus shall be a finite or infinite set of non-logical 
axioms. 

Following Tarski, we call such a system finitely axiomatizable , if the 
non-logical axioms are finite in number or all but some finite set of 
them are redundant (Example 2 § 74 ). We say of such systems that S 2 
is an extension of S x (or is a subsystem of S 2 ), if each formula 
provable in is provable in S 2 ; S 2 must then have all the non-logical 
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constants of S v but it may have others in addition. An extension S 2 of S x 
is a finite extension, if all but a finite number of the axioms of S 2 are 
provable in S v We say briefly that S : is undecidable to mean that the 
decision problem for provability in Si is unsolvable. 

Following Tarski, we say that S is essentially undecidable, if S is (simply) 
consistent, and every (simply) consistent extension of S is undecidable. 

Rosser 1936 showed that systems like our number-theoretic system of 
Chapter IV (if they are consistent) have this property (cf. Theorem 33 
§61). Then the formalized systems of axiomatic set theory of von Neu¬ 
mann 1925, of Bernays 1937-48 and of Godel 1940 (if consistent) are 
examples of systems S which are both essentially undecidable (since they 
include the usual number-theory) and finitely axiomatizable. Mostowski 
and Tarski (1949 abstract) were the first to note the existence of a system 
S which is both essentially undecidable and finitely axiomatizable, 
and also simple enough to be easily interpretable in various other theories, 
in the sense to be defined next. This provides the basis for the application 
of the method of Tarski which is given in Theorem 45 (b) and (c). A still 
simpler example of an essentially undecidable and finitely axiomatizable 
system is that of Raphael Robinson (1950 abstract), which has thirteen non- 
logical axioms as described in Lemma 18b § 49 on the basis of the predicate 
calculus, or seven only (Axs. 14, 15, 18—21, and the formula of *137, 
or equivalently of *136) as it was described by Robinson on the basis 
of the predicate calculus with equality. Robinson states that none of 
these seven can be omitted without sacrificing the essential undecidability. 

Tarski says that two systems S x and S 2 are compatible, if they have 
the same non-logical constants and a (simply) consistent common 
extension. Now consider any two systems S 1 and S 2 which in general 
do not have the same non-logical constants. First we take the case 
the logic is the predicate calculus with equality. Then S 2 is consistently 
interpretable in S v if S x and S 2 have a consistent common extension S 3 , 
in which there is provable, for each »-place predicate symbol P (function 
symbol f) of S 2 which S x lacks, a formula having the form P(x 1( ..., x n ) ~ 
F(x x , ..., x„) of an explicit definition of P (the form f(x x , .,x„) = w 
~ F(x x , ...,x„, w), i.e. (iii) of Lemma 26 §74) where the variables 
shown are distinct and F^, ...,x„) (F(x 1 , ...,x„, w)) contains only 
these variables free and as non-logical constants only ones of S x and 
possibly additional individual symbols. For the case the logical calculus 
is the predicate calculus and S 2 has function symbols which S x lacks, 
furthermore S x shall have = among its constants, and in S 3 there shall 
be provable the equality axioms for the predicate and function symbols 
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of S x . The situation is illustrated (rather trivially) by the S v S 2 and S 3 
in the proof of Theorem 43 § 74 (but either for (a), or for (b) assuming 
consistency). 

Theorem 45. (a) If S is undecidable, then every system S x which lacks 

none of the constants of S except possibly individual symbols and of which 
S is a finite extension is undecidable. 

(b) If S is essentially undecidable and finitely axiomatizable, then every 
system S x which lacks none of the constants of S except possibly individual 
symbols and which has a consistent common extension S 3 with S {in particular 
every system S x compatible with S) is undecidable . 

(c) If S is essentially undecidable and finitely axiomatizable, then every 
system S x in which S is consistently interpretable is undecidable. (Tarski 
1949 abstract.) 

Proofs, (a) By the reductions (B), (C), (D) applied only to the 
individual symbols of S which S x lacks, and (F). (Tarski takes the axioms 
to be closed ab initio, and deals with the case that S x and S have the same 
constants; then (B) and (D) are not required.) 

(b) Let S 2 be the system having the axioms (and constants) of both 
S x and S. Then S 2 is a subsystem of S 3 ; and hence, since S 3 is con¬ 
sistent, so is S 2 . Also S 2 is an extension of S; and hence, since S is es¬ 
sentially undecidable and S 2 is consistent, S 2 is undecidable. But S 2 
is a finite extension of S x . Now (a) applies with S 2 as its S. 

(c) We shall treat in detail the case S has a function symbol f as its 
only constant (except perhaps individual symbols) which S x lacks, for 
the predicate calculus as the logic. If S has more such function symbols, 
we merely iterate the application of Theorem 42 and its lemmas; and if 
5 has such predicate symbols, we use Example 1 § 74 likewise. The results 
will then hold also for the predicate calculus with equality, as (by 
Theorem 41 (b)) extending the logic to that is equivalent to assuming 
that the equality axioms for all the function and predicate symbols 
are present in all the systems considered (which only makes the argument 
easier). 

Let S 3 be the common extension of S x and S described in the definition 
of consistent interpretability (with S as the S 2 ). 

Let- S 4a be the subsystem of S 3 having as its constants those of S x , 
f, and the additional individual symbols (if any) belonging to S or 
occurring in F(x x , ..x n , w), and having as its non-logical axioms those 
of S x and of S, the equality axioms for the predicate and function 
symbols of S x , and (iii). 
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Since S 4a is a subsystem of S 3 and S 3 is consistent, S 4a is consistent. 
Hence, since S 4a is an extension of S and S is essentially undecidable, 
S 4a is undecidable. 

Using Lemmas 26 and 27 and Remark 2 (a) § 74 , in the list of 
non-logical axioms for S 4a we can replace each one A of the non-logical 
axioms of S by its principal f-less transform A', and (iii) by (i) and 
(ii), without changing the class of the provable formulas; call the re¬ 
sulting undecidable system S 4 . 

By Theorem 42 § 74, the function symbol f and its axiom (ii) can 
be eliminated from S 4> leaving a system S 6 which (by (IV) § 74 ) is also 
undecidable. 

But S 5 has as its constants only those of S x and possibly individual 
symbols, and S 6 is a finite extension of S v So by (a) (with S 6 as its S), 
S 4 is undecidable. 

Remark 4 . For the conditions on S 3 in the definition of consistent 
interpretability, instead of (iii) we may have provable in S 3 a formula 
having the form f(x 4 , ..., x n )=t(x 4 , ..., x„) of an explicit definition of 
f, where t(x 4 , ..., x n ) is a term containing only the variables shown and 
as non-logical constants only function symbols of except possibly 
individual symbols. For then t(x 4 , ..., x„)=w is an F(x 4 , ...,x„, w). 

Example 1 . By (b), since Robinson’s system, call it S, is essentially 
undecidable ar i finitely axiomatizable, every formal system S x with the 
constants =, ', +, • (and perhaps others, e.g. 0), the provable sentences 
of which express true propositions about natural numbers, is undecidable, 
if we accept the simple consistency of a common extension S 3 (say that 
one which has the axioms and constants of bo.h and 5 ) as guaranteed 
by the truth. To make the undecidability of such a system S 1 a meta- 
mathematical result, it remains to supply a metamathematical consistency 
proof for the S 3 . — By (c) and Remark 4 , since ' is definable explicitly 
from 1 as an individual symbol and +, the same holds for such systems 
with the constants =, +, • (at least). 

Starting from Mostowski and Tarski’s example of a finitely axioma¬ 
tizable and essentially undecidable system, Mostowski and Tarski (1949 
abstract), Tarski (1949a, 1949b abstracts), Julia Robinson (1949 abstract, 
1949) and Raphael Robinson (1949 abstract) obtain in rapid succession 
the undecidability of a variety of mathematical theories in the arithmetic 
of integers and rationals, rings, groups, fields, lattices and projective 
geometries. 
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CONSISTENCY, CLASSICAL AND INTUITIONISTIC SYSTEMS 

§ 77 . Gentzen’s formal system. In Example 2 § 73 we found what 
may be described as a direct way of deducing A(x) from the axioms 
of S in the predicate calculus when (Ey)R(x, y). This -direct way can 
lead from the axioms to A(x) only when ( Ey)R{x,y ). To establish the 
consistency property, i.e. that A(x) is deducible only when ( Ey)R(x , y), 
what we must do is to show that a roundabout way of proceeding in 
the predicate calculus can lead from the axioms to A(x) only when the 
direct way does. In the formalism of recursive functions, the corresponding 
consistency problem was trivial (§ 54 , Example 3 § 60 ), precisely because 
no other than the direct way of proceeding from the assumption formulas 
was allowed by the rules of the system. This leads us to inquire whether 
there may not be a theorem about the predicate calculus asserting that, if 
a formula is provable (or deducible from other formulas), it is provable 
(or deducible) in a certain direct fashion; in other wards, a theorem giving 
a normal form for proofs and deductions, the proofs and deductions in 
normal form being in some sense direct. 

A theorem of this sort was obtained by Gentzen 1934-5*. We shall 
present it in § 78 , and apply it in § 79 to obtain the consistency results 
referred to in Example 2 § 60 and Example 2 § 73 (and used in § 76 ), 
as well as the consistency of number theory with the restricted rule of 
induction (mentioned at the beginning of § 42). These consistency proofs 
can be given by other methods, as by Ackermann 1924-5, von Neumann 
1927 and Herbrand 1930, 1931-2. All are somewhat long. Gentzen’s is 
one of the easiest to follow, as the proof of his “Hauptsatz” or normal 
form theorem (of which it mainly consists) breaks down into a list of 
cases, each of which is simple to handle. Another application of this 
theorem is given in § 80 . Except incidentally, §§ 81 and 82 are independent 
of §§ 77 — 80 . 

Gentzen’s normal form for proofs in the predicate calculus requires 
a different classification of the deductive steps than is given by the 
postulates of the formal system of predicate calculus of Chapter IV 
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(§ 19). The implication symbol D has to be separated in its role of 
mediating inferences from its role as a component symbol of the formula 
being proved. In the former role it will be replaced by a new formal 
symbol -»■ (read “gives” or “entails”), to which properties will be 
assigned similar to those of the informal symbol “ in our former 
derived rules. 

Gentzen’s classification of the deductive operations is made explicit 
by setting up a new formal system of the predicate calculus. The formal 
system of propositional and predicate calculus studied previously 
(Chapters IV ff.) we call now a Hilbert-type system, and denote by H. 
Precisely, H denotes any one or a particular one of several systems, 
according to whether we are considering propositional calculus or pred¬ 
icate calculus, in the classical or the intuitionistic version (§ 23), and 
according to the sense in which we are using ‘term’ and ‘formula’ (§§ 17, 
25, 31, 37, 72—76). The same respective choices will apply to the Gentzen- 
type system Gl which we introduce now and the G2, G3 and G3a later. 

The transformation or deductive rules of Gl will apply to objects 
which are not formulas of the system H, but are built from them by an 
additional formation rule, so we use a new term ‘sequent’ for these 
objects. (Gentzen says “Sequenz”, which we translate as “sequent”, 
because we have already used “sequence” for any succession of objects, 
where the German is “Folge”.) A sequent is a formal expression of the 
form Aj,..., A, -*■ B x , ..., B m where l, m ;> 0 and A x , ..., A,, B 1( ..., B m 
are formulas. The part A 1 , ..., A t is the antecedent, and B x , ..., B m 
the succedent of the sequent A x , .... A t -*B 1 , ..., B m . 

When /,*»;> 1, the sequent A x , ..., A t —*B 1( ..., B m has the same 
interpretation for Gl as the formula A x & ... & A t 3 B x V ... V B m for H. 
The interpretation extends to the cases with l — 0 or m = 0 by 
regarding A x & ... & A { for Z = 0 (the “empty conjunction”) as true and 
B x V ... V B m for m — 0 (the “empty disjunction”) as false. 

A formula occurs in (or belongs to) a sequent A x , ..., A t —*B X , ..., B m , 
if it is one of the l-\-m occurrences of formulas A x , .... A,, B x , ..., B m ; 
and similarly for occurrence of a formula in antecedent or succedent. 
For example, Gl and Gl & B but not 8 occur in the sequent G4, Gl &B —*■ Gl. 
A variable (symbol, quantifier, etc.) occurs in a sequent, etc., if it occurs 
in some formula of the same. 

As in Chapter V, we use Greek capitals ‘T”, “A”, “0”, “A”, etc. 
to stand for finite sequences of zero or more formulas, but now also as 
antecedent (succedent), or parts of antecedent (succedent), with sep¬ 
arating formal commas included. 
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Postulates for the formal system G 1 


Stipulations: A, B, C, D are formulas; I\ A, 0, A are finite sequences 
of zero or more formulas; x is a variable; A(x) is a formula; t is a term 
free for x in A(x); and b is a variable free for x in A(x) and (unless b is x) 
not occurring free in A(x). 

Restriction on variables (for two of the postulates as indicated): 
The variable b of the postulate shall not occur free in its conclusion. 
(When the A(x) does not contain the x free, then A(b) is A(x) no matter 
what variable b is; we agree in such a case to choose for the analysis 
a b not occurring free in the conclusion, so that the restriction is met.) 

The difference between the classical and intuitionistic systems G\ 
is secured by the intuitionistic restriction stated for two of the postulates. 

Axiom schema. 

C-»C. 


Logical rules of inference for the propositional calculus. 
Introduction of in succedent. in antecedent. 


D 

& 

V 


A, r-*0,B 
r-*0, adb. 
r-* 0 , a r-* 0 , b 

r -*•©, a & b. 
r-*©, a r-*©, b 

r -*> 0, A V B. r -*• 0, A V B. 
a, r-»0 
r-*©,iA, 

with 0 empty for 
the intuitionistic system. 


A-*A, A 

b, r-»0 

A DB, A, 

r-»A, 0 . 

A, r-*© 

b, r-*0 

A&B, r-»0. 

a&b, r-»0. 

a, r-^0 

b, r-»0 


A V B, T 0. 
T 0, A 
-i A, T-»0. 


Additional logical rules of inference 
for the predicate calculus. 

Introduction of in succedent. in antecedent. 

r-*0,A(b) A(t), T -*• 0 

r 0, VxA(x), VxA(x), r —»■ 0. 

subject to the 
restriction on variables. 


V 
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3 


r-» 0 , A(t) 

r 0, 3xA(x). 


A(b), r -» 0 
3xA(x), r -*■ 0, 
subject to the 
restriction on variables. 


Thinning 


Structural rules of inference, 
in succedent. 


in antecedent. 


T -»0 
r-*©; C, 

with 0 empty for the 
intuitionistic system. 


r-*@ 

c, r-*©. 


Contraction 


r -> 0, c, c 
r-*©,c. 


c, c, r-*© 
c, r-*0. 


Interchange 


r A, C, D, 0 
r —► A, D, C, 0. 


Cut. 



A, D, C, T -^0 
A, C, D, r —► 0 . 
c, r-»© 

A, 0. 


For the classical system Gl, the postulates except the two 3-rules 
fall into a dual-symmetric arrangement, thus. The rules for & and V 
are dual to each other, the one set being transformed into the other by 
the interchange of & with V and —► with Similarly, the V- and 3-rules 
are dual. The axiom schema, the -i-rules, and the structural rules of the 
four kinds, are each self-dual. 

The rules in the left column we call succedent rules', and we denote 
them briefly by “-*3”, &”, V”, ‘W-i”, V”, 3”, T ”, 

“-*• C”, ► J”, respectively. The rules in the right column we call 

antecedent rules, and denote by " 3 “& etc. 

The logical rules constitute introductions of a logical symbol, but 
sometimes in the succedent (left column), and sometimes in the anteced¬ 
ent (right column). The formula in which the logical symbol is introduced 
is called the principal formula ', and the one or two formulas shown ex¬ 
plicitly in the premise (s) the side formula(s). 


Example 1. The uppermost rule in the right column is “3-intro- 
duction in the antecedent”, or ‘‘the 3-antecedent rule”, or briefly 
“ 3 —The principal formula is A 3 B, the first premise has A as side 
formula, and the second B. 
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The logical rules of Gl are more or less similar in form to the respective 
derived rules of Theorem 2 § 23, with the undefined formal symbol -*• 
appearing now in place of the defined metamathematical symbol “ 

An introduction in the succedent corresponds to an introduction in 
Theorem 2, and in the antecedent to an elimination in Theorem 2. 
The present axiom schema and structural rules correspond to general 
properties of the former f- as listed in Lemma 5 § 20. 

The tree form (end § 24) is used in the construction of proofs in G 1; 
and "b S”, where S is a sequent, is used to express that the sequent S 
is provable. 

In exhibiting proofs (or parts thereof), it is tedious to show separately 
all the applications of the one-premise structural rules. We shall adopt 
the convention that a double line (with or without citation of another 
rule) stands for a sequence of zero or more thinnings ("T”), contractions 
("C”) and interchanges (“I”) (following the application of the other 
rule when another is cited). 

Example 2. For any formulas A, B and C, (a) and (b) are proofs 
in G 1 intuitionistically, (c) only classically. 


B B C-> C . 

A —^ A B 3 C, B —► C ^ 

A -*■ A B, A D (B D C), A-*C ^ 
A, A D (B D C), A D B -*C ^ 

AD(BDC), A z> B A D C 
A D B (A D (B D C)) D (A 3 C) ~ 
(A D B) D ((A D (B D C)) D (A D C)). 


A-* A 

A, -i A B 
-iA A D B “ 
-iA3(AdB). 


D 


A, “i A 
iA -»• A 
i-i A D A. 


From the postulate list for Gl, we verify by induction: 

Lemma 32a. If T B 1( ..., B m in the intuitionistic system Gl, 
then m = 0 or m = 1. 

Lemma 32b. If (- T-^Bj, ...,B m in the intuitionistic system Gl 
without using the rule -i —then m — 1. 
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Lemma 32a expresses the whole difference between the intuitionistic 
and the classical system Gl; but it is useful to observe that we needed to 
restrict only two of the postulates for the intuitionistic system G 1 to 
secure this difference. 

Our first objective is to show (as did Gentzen) that system G 1 is equivalent 
to our former system H, in the sense that, for any formula E, |—► E 
in Gl, if and only if j- E in H. This result will be contained in the 
next two theorems for F empty. 

Theorem 46. If T (- E in H with all variables held constant, then 
j- F —> E in G \. When the given deduction in H uses the -postulates only for 
certain of the symbols D, &, V, -i, V, 3, then the resulting proof in Gl uses 
the logical rules only for D and for the same symbols. 

Proof, by course-of-values induction on the length of the given de¬ 
duction in H of E from T. Sixteen cases arise, as follows, according to the 
analysis for E in the given deduction. 

Case 0: E is one of the formulas T. Then the following is a proof 
in Gl of T -*■ E. 

E-> E 
T-*E. 

The other cases will be numbered as the respective postulates la, lb, 
2, 3, 4a, 4b, 5a, 5b, 6, 7, 8 or 8 1 , 9 — 12 of H. 

Case lb: E is an axiom by Axiom Schema lb, i.e. E is 
(AdB) 3 ((A 3 (B 3 C)) D (A D C)) for some formulas A, B and C. 
Writing T before the —*■ in the endsequent of Example 2 (a), and doubling 
the line over it (to represent thinnings), makes that tree into a proof in 
GlofT-^E. 


Case 2: E is an immediate consequence of two preceding formulas 
by Rule 2, i.e. for some pair of formulas A and B, E is B and the two 
preceding formulas are A and A D B. By the hypothesis of the in¬ 
duction, there are proofs in Gl of T-* A and F-* A D B. Grafting these 
two proofs onto the following tree, we obtain a proof of B. 


B -»B 


ADB AdB, A-^B 


A, T^B 


Cut 


B. 


Cut 


Case 8 or 8 1 . See Example 2 (c) or (b), respectively. 
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Case 9: E is an immediate consequence of a preceding formula 
by Rule 9. Then E is C 3 VxA(x), and the preceding formula is C 3 A(x), 
where C does not contain x free. Let I\ be the subsequence of T comprising 
those of the formulas T on which that preceding formula C 3 A(x) 
depends in the given deduction of E from T. By omitting from the given 
deduction all formulas below the formula C 3 A(x) and those above 
which depend on other assumption formulas than I\, we obtain a de¬ 
duction of C 3 A(x) from I\. Applying the hypothesis of the induction 
to this deduction, there is a proof in Gl of C 3 A(x). Since the varia¬ 
bles are held constant in the given deduction of C 3 VxA(x) from T, 
none of the formulas contains free the variable x of the application 
of Rule 9. This and the fact that C does not contain x free are used in 
verifying that the restriction on variables is satisfied for the -*V in 
the following. 


A(x) —*• A(x) 


I\-»C 3 A(x) C 3 A(x), C -»■ A(x) 

C,r 1 -»A(x) ~~ 

C, T 1 -^VxA(x) 
r-^CD VxA(x). 


Cut 


Case 12: E is an immediate consequence of a preceding formula 
by Rule 12. Dual to Case 9. 


We introduce the notation to be used in Theorem 47. Let F be some 
particular closed formula. Say that 0 is Bj, ..., B m (m > 0). Then let 
0' be Bj, ..., B m-1 (empty if m <, 1), 0” be B m if m ^ 1 and -i (F 3 F) 
if m = 0, -i 0 be -i B x , ..., -i B m , and -i 0' be -i B x , ..., ~iB m _ 1 (empty 
if m <, 1). 


Corollary 1 . If T, -i0' |- 0” in H with all variables held constant, 
and provided for the intuitionistic systems that ra < 1, then h T —* 0 
in Gl. 


Use cases according as w = 1, m = 0 or m > 1. 


Corollary 2. For the classical {intuitionistic ) systems when 7, w ^ 1 
(7 ^ 1, m = 1): // f- Aj & ... & A[ D Bj V ... V B m in H, then 
f- Aj, ..., A t Bj, ..., B m in G 1. 

Theorem 47. If h 0 in Gl, then T, -i0' 0” in H with all 

variables held constant. (In particular: If 1- T—► E in Gl, then r h E 
in H with all variables held constant.) 
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Moreover, for the intuitionistic (classical ) systems, when the given -proof 
in G\ uses the logical rules only for certain of the symbols D, &, V, -i, V, 3, 
then the resulting deduction in H uses only the 'D-postulates (the D- and 
—i-postulates) and the postulates for the same symbols, provided in case the 
symbols include V but not & the V -postulates include Axiom Schema 9a 
of Lemma 11 § 24. 

Proof. We prove the first statement of the theorem by course-of- 
values induction on the height (i.e. number of levels) of the given proof 
in G\ of T-* ©, using cases according to the postulate of Gl applied last 
in this proof. 

The additional details given in the second part of the theorem will 
be verified after we have gone through the cases. In all but the cases 
and subcases indicated by a single or double star, the demonstration of 
the existence of the resulting deduction is almost immediate by use of 
general properties of b and, in the case of a logical rule of Gl, an ap¬ 
plication of the corresponding derived rule (other than a -i -rule) of 
Theorem 2 § 23, or in the case of -► V and 3 -*■ of the corresponding strong 
rule of Lemma 10 § 24. In the cases marked by a single star, we use also 
the intuitionistic -i-rules (-i-introd. and weak -i-elim. §23) and some¬ 
times * 1 § 26. In the cases marked by a double star, we further use the 
classical (strong) -i -elimination rule (Theorem 2). None of the nine cases 
not mentioned below requires either subcases or starring. 

Case 1: the axiom schema. By general properties of h C b C in H. 

Case 2: -*• D. We have by the hypothesis of the induction that 
A, T, “i © b B in H with all variables held constant, and we must infer 
that T, -i © b A D B likewise. This we can do by D-introd. 

Case 3: D Subcase 1: A empty or © not empty. Then (A, ©)" 
is 0". By hyp. ind., A, nA b A and B, T, -i0' b 0". Thence, using 
D-elim., A D B, A, I\ -iA, -i©' b ©". Subcase 2**: A not empty and 
0 empty. By hyp. ind., A, -iA b A and B, T |- t(Fd F). By D-elim., 
A D B, A, T, -iA |- -i(FdF). Using *1, A D B, A, T, -iA b F D F. 
Hence, by -i-introd. and -i-elim., A D B, A, r,-tA' b -i“iA" b A". 

Case 10*: ->-i. Subcase 1: 0 empty. From A, T b ~i(F D F), 
we infer T b “iA by *1 and -i-introd. Subcase 2: 0 not empty. From 
A, T, -i0' b 0", we infer T, -i0', -i0'' b “>A by -i-introd. 

Case 11: -i Subcase 1*: ©empty. Use weak-i-elim. Subcase 2**: 
0 not empty. Use -i-introd. and -i-elim. 
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Case 12: —»-V. By hyp. ind., T, -i0 b A(b) with all variables held 

constant. By strong V-introd., A(b) b b VxA(x). Hence F, -i 0 (- VxA(x), 
with all variables including b held constant, since by the restriction on 
variables for -»V, T and 0 do not contain b free. 

Case 16: any one-premise structural rule. Let the premise be 0, 
and the conclusion be rl -*■ 01. Then rl (0 + ) comes from T (0) by 
permuting formulas, suppressing repetitions of formulas, or introducing 
new formulas. Subcase 1: ©Ms 0. From r, -. 0' |- 0", we infer rl, 
~i0' |- 0" by general properties of h Subcase 2*: 01 is not 0, and 0 is 
empty. By hyp. ind., V b -i(F D F). Thence by general properties of b> 
rt,-i0t' p -i(FdF). Thence by *1 and weak -i-elim., d, -i01' 
b 01". Subcase 3**: 01 is not 0, and 0 is not empty. Then 01 is not 
empty, and at least one of 0 and 01 consists of more than one formula. 
By hyp. ind., T,-i 0' b 0". By weak -i-elim., T, -i0', -i0" h i(F 3F), 
i.e. r, ->0 |- -i(F D F). Thence by general properties of b d, ->0l 
b -i(FDF), i.e. rl,-i 01',-i 01" b ~* (F 3 F). Thence by *1, -i-introd. 
and -.-elim., rl, -.01' b 01". 

Case 17: Cut. Treated as Case 3, except without using 3-elim. 
Subcase 1: A empty or 0 not empty. Subcase 2**: A not empty and 
0 empty. 

To verify the second statement of the theorem, first suppose that 
the given proof in G 1 of r~» 0 where 0 is B x , ..., B m is intuitionistic 
and does not use the -.-rules. Then using Lemma 32b, we see that none 
of the singly or doubly starred cases or subcases can occur. The statement 
then follows from the manner of treatment of the unstarred cases, by 
Lemma 11 § 24. 

If the proof does use the —i -rules but is intuitionistic, the statement 
permits the intuitionistic -i -postulates of H to be used in the resulting 
deduction, and we need only verify, using Lemma 32a, that none of the 
doubly starred cases or subcases can occur. 

Corollary. When l, m >. 1 (and hence for the intuitionistic systems, 
m — 1): If b A x , .. A[ -*■ B x , ..., B m in G 1, then 
b A x & ... & A, D B x V ... V B m in H. 

§ 78. Gentzen’s normal form theorem. Example 1. The proof (a) 
contains a cut, while (b) is a proof without cut of the same 
sequent. 
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ZZ{b) -*■ (b) ^ 3x<C7(x) -» 3xc^7(x) 

Zi(b) -*■ 3x<£7(x) 3x<C?(x), -i3x<3(x) -+ 

Zi{b), -i3x<C7(x) -*■ 

-i3x<^f(x) 

-t3x<^?(x) —*Vx -tcZ(x) 

—> -i3x<^7(x) 15 Vx“t<3(x). 



(b) 


«af(£) -*Zi{b) 
d(b) —► 3x<CT(x) 
csHjj), -i3xc^7(x) —*• 

-i 3x<^7(x) -► “i <£?(£) 
-i3x<£?(x) -» Vx ~i <C?(x) ~ 
-*-i3x«£?(x) D Vx-»<C2(x). 


The first proof uses an unnecessarily complicated formula 3x^7(x) in 
the antecedents of the right branch. This complication is unravelled 
by means of the cut. The second proof proceeds directly, without in¬ 
troducing complications that are subsequently unravelled. 

The significance of proofs without cut, as in a sense proofs in normal 
form, is further emphasized by the subformula property (Lemma 33a 
below). 

We define ‘subformula' of a given formula thus. 

1. If A is a formula, A is a subformula of A. 2 — 4. If A and B are 
formulas, the subformulas of A and the subformulas of B are subformulas 
of A D B, A & B and A V B. 5. If A is a formula, the subformulas of 
A are subformulas of -iA. 6 — 7. If x is a variable, A(x) is a formula, 
and t is a term free for x in A(x), the subformulas of A(t) are subformulas 
of VxA(x) and 3xA(x). 8. A formula has only the subformulas required 
by 1—7. 

Example 2. The subformulas of .2 3(ic3DS) are the five 
formulas <C7 3 (-i<£7 D B), <^?, -ic2f 3 B, -\d, C B. 

Example 3. (a) The subformulas of V/>Vc(B(c) & <£?(£)) are the 

formulas V^Vc(B(c) & a(b)), Vc(B(c) & <Sf(t)), B{u) & cT(t), 8(u), c'T(t), 
for every term t not containing c free and every term u. (b) The only 
subformula of <C7(c) is <C7(c). 

From the postulate list for G 1, we verify by induction: 
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Lemma 33a. Each formula occurring in any sequent of a proof in G 1 
without a cut is a subformula of some formula occurring in the endsequent. 
(Subformula property.) 

Lemma 33b. Each formula occurring in the antecedent (succedent) 
of any sequent of a proof in (71 without a cut or application of the D- or 
-i -rules is a subformula of some formula occurring in the antecedent ( suc¬ 
cedent) of the endsequent. 

Gentzen’s “Hauptsatz” or normal form theorem (Theorem 48 below) 
asserts that the cuts can always be eliminated from the proof of any 
sequent in which no variable both occurs free and occurs bound. 

The restriction that no variable occur both free and bound in the end- 
sequent does not detract from the usefulness of the theorem. For when 
variables do occur both free and bound in a given sequent, by replacing 
the formulas by others congruent to them we can obtain a sequent 
satisfying the restriction and provable if and only if the given one is 
provable (by Theorem 47, Lemma 15b § 33, *18a and *18b § 26, and Corol¬ 
lary 1 Theorem 46). The restriction is necessary, as we illustrate now. 

Example 4. Consider the proof 

-> < 3 ( 6 ) 

3(c) & <3(A)-* <3(A) ^ ^ 

Vc(B(c) & d{b)) c 1(b) v “* 

VAVc(B(c) & «3(£)) -* <77(A) V <3(c)-*c3(c) v 

VAVc(B(c) & c 71(b)) -> VbGEb) VA<^(A) <3(c) ^ 

VAVc(3(c) & cT(A)) a{c). U 

The cut cannot be eliminated. For by Lemma 33a, no sequent in a proof 
without cut of VAVc(3(c) &<7?(A))-* cT(c) can contain the symbols D 
and -i; so the D- and -i -rules cannot be used. Hence Lemma 33b applies. 
But the two lists of subformulas in Example 3 have no formula in com¬ 
mon, so that no axiom satisfies the requirement of Lemma 33b. (The 
method of this example will be developed further in § 80.) 

In proving the normal form theorem, we use a formal system (72 
obtained from (71 by changing two of the postulates, as follows. 

The cut is replaced by the following rule, called “mix”. Here M is 
a formula (the mix formula ); II, O, £, £2 are sequences of zero or more 
formulas such that both O and £ contain M; and 0 M and S M are the 
results of suppressing all occurrences of M in O and £, respectively. 
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Example 5. 


n -*■ o 

n, s M 


o 



Q 


Mix 


The following is a mix. 

SVC, S,Z), S-» 

a, s v c, t> 


Mix 


The Dof G1 (distinguished as D —^j) is replaced by a new one 
(D -* 2 ), due for the classical system to Ketonen 1944. Here A, B, T, 0 
are as above; and 0° is 0 for the classical system and empty for the 
intuitionistic. 


T-> 0°, A B, r-*0 
ADB,r-4 0. 




Any cut can be accomplished by a mix (below left), and vice versa 
(right), with the help of TCI steps. 

A-*A,C C,r-» 0 „. II-*.® 

- - ■■■■« ' ■ » " ■ ■ « —■ — P-T f » " — , f « , ■■■ 

A, Tq Ap, 0 II ► ®jj, M M, Sjj —► Q 

A, r-*A, 0 . n, s M — <d m , q. u 

Similarly, any O can be accomplished by a D -* 8 (below), noting 
that for the intuitionistic case A is empty by Lemma 32a; and vice 
versa (left to the reader). 

A-*A, A B, r-*0 

A, T A, 0°, A B, A, T-* A, 0 

A D B, A, T-* A, 0. ° ~* 2 


This shows that the two changes do not alter the class of the provable 
sequents, as we state with additional details in Lemma 34. 

A proof in Gl or in G2 is said to have the pure variable property, or 
to be a pure variable proof, if no variable both occurs free and occurs 
bound in the proof, and for each application of —► V or 3 —► the variable 
b of the application occurs only in sequents above the conclusion of the 
application (where if the A(x) does not contain the x free, we choose 
for the analysis a b occurring only thus). 

Lemma 34. If (- T 0 in G 1, then |- T —► 0 in 62; ami conversely. 
The proof in G2 contains a mix, only if the proof in Gl contains a cut; 
and conversely. Either proof contains applications of exactly the same-named 
logical rules as the other. If either proof has the pure variable property, so 
does the other. Lemmas 32a—33b (stated above for Gl ami the cut) hold 
good also for G2 and the mix. 
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Lemma 35. A given postulate application of G 1 or G2 remains an 
application of the same postulate when, in the sequent(s ) of the application 
(namely, the axiom in the case of the axiom schema, and the premise(s ) and 
conclusion in the case of a rule of inference), a variable is changed in exactly 
its free occurrences (or in exactly its bound occurrences) to another variable not 
occurring either free or bound in the sequent(s) in question. 

Lemma 35, in the case of an application of -» V or 3 —>, depends on 
the fact that the b and the x do not need to be the same variable (as 
they did for the corresponding rules 9 and 12 of the system H). 

Lemma 36. A given postulate application of Gl or G2 remains an 
application of the same postulate when, in the sequent(s) of the application, 
a given term is substituted for (the free occurrences of) a given variable, 
provided in the case of an-*- V or 3 -» that (i) the term does not contain the 
b of the application and (ii) the variable is not the b of the application, 
and also provided in every case that (iii) the term is free for the variable in 
each formula of the sequent(s). 

Condition (i) insures that the restriction on variables remains satisfied 
for an -» V or 3 -±. 

Lemma 37. Given a proof in G 1 or G2 of a sequent in which no variable 
both occurs free and occurs bound, by changes in free and bound occurrences 
of variables in sequents of the proof (each postulate application remaining an 
application of the same postulate), we can obtain a proof of the same sequent 
in which proof no variable both occurs free and occurs bound. 

Proof. Let x 1( ..., x„ be the distinct variables which occur free 
in the endsequent and occur bound (elsewhere) in the proof, and 
a 1( ..., a„ be the other distinct variables which occur both free and bound 
in the proof. Let y 1( ..., y„, b 1 , .. .,b p be distinct variables not occurring 
in the proof. Change x x , ... ,x„ in their bound occurrences only toy!, .. . ,y„, 
respectively, and a x , ..a p in their free occurences only to b lt ..bp. 
By Lemma 35, the figure remains a proof; and by the hypothesis that the 
endsequent contains no variable both free and bound, the endsequent is 
unchanged. 

Lemma 38'. Given a proof in Gl or G2 in which no variable occurs 
both free and bound, by changes only in free occurrences of variables in it 
(each postulate application remaining an application of the same postulate), 
we can obtain a pure variable proof of the same sequent. 
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Proof. Suppose that in the given proof in G\ or G2 there are exactly 
q applications of the rules —► V and 3 —► with respective variables c lt ... ,c Q 
(not necessarily distinct) as the b’s of the applications. Select any q 
distinct variables d lt ..., d„ not occurring in the given proof. Choose 
one of the applications which is uppermost (i.e. has no other over it), 
say its b is c x . Substitute d x for c x throughout all sequents above the con¬ 
clusion of the application, but nowhere else. Under the restriction on 
variables for —► V and 3 —c x cannot occur free in the conclusion of the 
application, so using Lemma 35 all postulate applications remain valid. 
Repeat the procedure, each time working on one of the applications of 

V or 3 -*■ which is uppermost amongst those not yet treated, until 
the b’s of all q of the applications have been changed from c’s to d’s. 
Because each substitution alters only sequents above the conclusion of 
an application of -»• V or 3 -*•, the endsequent is unaltered. 

Theorem 48. Given a proof in G 1 {in G2) of a sequent in which no 
variable occurs both free and bound, another proof in G1 {in G2) of the same 
sequent can be found which contains no cut {no mix). This proof is a pure 
variable proof. The only logical rules applied in it are ones which were 
applied in the given proof. (Gentzen’s Hauptsatz or normal form theorem, 
1934-5) 

Proof, reducing the theorem to a lemma. By Lemmas 34, 37 and 38, 
it suffices to prove the theorem for G2 assuming the given proof already 
to have the pure variable property. We do so by induction on the number 
m of mixes in this ‘given proof’. If m > 0 , there must occur in it a mix 
which has no other mix over it. Consider the part of the given proof 
which terminates with the conclusion IT, 0 M , Q of this mix; call 

it the ‘given part’. Suppose that we can transform this given part so as 
to obtain another proof in G2 of II, S M —> 0 M , II without mix; call it the 
‘resulting part’. Then the replacement of the given part by the resulting 
part in the given proof gives us a new proof in G2 of the same sequent 
with only m— 1 mixes. Suppose further that the resulting part can be 
constructed so that it has the pure variable property by itself, and also 
contains no variable free (bound; as the b of an V or 3 -*■) that did not 
so occur in the given part. Then the new proof as a whole will have the 
pure variable property. Hence we can apply the hypothesis of the in¬ 
duction to conclude that there is a pure variable proof with no mix. To 
prove the theorem it thus remains to establish the following lemma. 

Example 1 (continued). The given part for (a) (restated with a 
mix instead of a cut) is as follows. 
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a(b) a{b) 

!%(b) Ix^jx) 


3xc2?(x) —*■ 3x<3?(x) 
n3xc2f(x), 3xL?(x) —) 


Mix 


c 31(b), -i3x<37(x) 


Lemma 39. Given a proof in G2 of n, S M —*• <D M , Q with a mix as 
the final step, and no other mix, and with the pure variable property, a 
proof in G2 of II, S M — > 0 M , Q can be found with no mix, with the pure 
variable property, and with no variable occurring free (bound ; as the b of an 

V or 3 —>) which did not so occur in the given proof. Only logical rules 
are applied in the resulting proof which are applied in the given proof. 
(Principal lemma.) 

Proof of the principal lemma. We define the left rank a of a mix 
as the greatest number of sequents, located consecutively one above 
another at the bottom of any branch terminating with the left premise of 
the mix, which contain the mix formula M in the succedent. The right 
rank b is defined similarly. The rank r = a-\-b. (The least possible rank 
is 2.) The grade g of the mix is the number (> 0) of occurrences of logical 
symbols (D, &, V, -i, V, 3) in the mix formula M. 

Example 1 (concluded). The left rank is 1 , the right rank 2, the 
rank 3, and the grade 1. 


The lemma is proved by course-of-values induction on the grade g 
of the mix. Within both the basis and the induction step of this induction, 
a course-of-values induction is used on the rank r. We give a treatment 
by cases, so that by drawing upon the results of the cases the bases and 
induction steps of the inductions can all be carried through. 

We write the mix to be eliminated thus, 


n-+o s-*o 
n, s M <d m , a, 

where M £ O and MSS. 


or briefly 



S 2 


The letters “A”, "B”, “C”, “D”, ‘T”, "0”, "x”, “A(x)”, “t”, “b” will 
refer to the statements of the other postulates in question. 


A. Preliminary cases. 


Case la: S x has M in the antecedent, i.e. M 8 IT. Then the con¬ 
clusion If, S M -*• ® M , of the mix comes from its second premise S —> Q 
by TCI, and is hence provable without mix. The intuitionistic restriction 
on the —> T’s can be satisfied, as the TCI steps terminate with the 
original endsequent, which by Lemma 32a can have intuitionistically 
at most one formula in its succedent. 
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Case 2a: the left rank is 1, and S x is by a structural rule. Since 
M 8 <J>, but not to the succedent of the premise for the inference of S x , 
the inference can only be a T with M as the C. Thus the bottom of the 
given proof is as at the left below with M t 0. We alter this to the figure 
at the right, to obtain a proof of the original endsequent without mix. 


0 


0, M 


■T 


Q 


r, » 0 , o. 


Mix 


r -»0 

r, s M ->0, a 


Cases lb, 2b: similarly, reading "S 2 ”, "succedent”, “Q”, "right” in 
place of “S/’, “antecedent”, "11”, "left”, respectively. Treated sym¬ 
metrically to Cases la, 2a. 

B. Other cases. For each of these cases, it is part of the case hy¬ 
pothesis that none of the four preliminary cases applies. 


Bl: the rank is 2. Since Cases la, lb are excluded, S x and S 2 must 
both be conclusions of inferences. Since the rank is 2 and Cases 2a and 2b 
are excluded, both inferences must be logical. Moreover, since the rank 
is only 2, M must be the principal formula of both inferences. Therefore 
M contains a logical symbol, and the mix is of grade > 1. Thus these 
cases can arise only under the induction step for the induction on grade, 
and the hypothesis of the induction on the grade is available in treating 
them. The rules used in inferring S x and S 2 can only be, respectively, the 
rule introducing the outermost logical symbol of M in succedent, and the 
rule introducing the same symbol in antecedent. Thus under Bl we 
have only the following six cases. 


Case 3: S x is by -*■ "D , and S 2 by 13 with M as the principal 
formula A D B. Then the bottom of the given proof is as follows, where 
ADB g 0, f since the rank is only 2. 


a, n 0 , b ^ 

n 0, A D B D 

— 


r-*a°,A B,r-*a 


a => b, r 

0, a 


a 


Mix 


D -* 


We alter this to the following. 

r q°, a 


A, B 0, B 


r, if 


Q° a , 0, B 


Mix 


b, r-*a 


r n r —n° 0 o 

1 ’ li A’ 1 B AB’ U B> ” 

n, r —> 0 , q. 


Mix 
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The upper mix is of grade lower than the original, so by the hypothesis 
of the induction on the grade, it can be eliminated, i.e. a proof without 
mix of its conclusion can be found. Then the lower mix will have no mix 
over it, and can likewise be eliminated. Thus we obtain a proof without 
mix of the original endsequent II, T-> 0, LI. 

Case 4: S x is by &, and S 2 by &-»■, with M as the principal 
formula A & B. 

Case 5: S x is by —► V, and S 2 by V with M as the principal 
formula A V B. Treatment is dual to that of Case 4. 

Case 6: S 2 is by -> -i, and S 2 by -i with M as the principal 
formula -iA. 


Case 7: Sj is by ->V, and S 2 by V->, with M as the principal 
formula VxA(x). We have as the given figure, where VxA(x) $ 0, T, 


n -» Q, A(b) A(t), r-»Q 

II -» 0, VxA(x) VxA(x), r —^ Q 

-——-Mix 


In constructing the altered figure, we shall need a proof of II -*• 0, A(t). 
If A(x) does not contain x free, then A(b) and A(t) are the same formula, 
and we already have it. Suppose then that A(x) does contain x free. Then 
A(t) contains t, and so, since t is free for x in A(x), the variables of t occur 
free in A(t). Now we can conclude from the pure variable property of the 
given proof of II, T -*■ 0, Q that (i) no variable of t is the b of an V 
or 3 in the proof of II -*• 0, A(b) in the left branch of our figure, 
(ii) our b is not the b of an —> V or 3 in the same, and (iii) t is 
free for b in all formulas of the same (since the pure variable property 
allows no variable to occur both free and bound). Moreover, in view 
of the restriction on variables for the exhibited -*V, (iv) b does not 
occur in II, 0. Therefore, using Lemma 36, by substituting t for b 
throughout the proof of II —► 0, A(b) we obtain a proof of II — > 0, A(t). 
When this proof is used in the altered figure as follows, the new proof of 
II, T-> 0, Q thus obtained will have the pure variable property and 
will contain free (bound; as the b of an V or 3 —»-) no variable not 
previously so occurring. 


n-»0,A(t) 


A(t), r-> Q 


n,r 


A(t)" 


®A(t)> ^ 


Mix 


n, r->©, a. 
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Case 8 : S x is by -»3, and S 2 by 3 with M as the principal 
formula 3xA(x). Dual to Case 7. 

B2: the rank is > 2. These cases can only occur in the induction step 
of an induction on the rank (either within the basis, or the induction 
step, of the induction on the grade). Hence the hypothesis of the in¬ 
duction on rank is available in treating them. 

B2. 1 : the left rank is > 2. Then M occurs in the succedent of at 
least one of the premises for the inference of S x . 

Case 9a: S x is by a succedent structural rule S with M not the C 
and not the D, or S x is by an antecedent structural rule S. Let the given 
figure be as shown at the left. We alter this as shown at the right (ex¬ 
planation follows). 


Hi-*- $1 ^ 

^->0, 2 Mix 

n -> o £2 

M1Y 

n x , s m — > ^ 

n, —> 3> m , q. 

Hi* ► £2, O XM ^ 


n. — ► £2. 


n, s M — > <j> m , o. 

Because M £ $ x (by B2. 1 ), we can 

take ITi -*■ O x as first premise for 


the new mix. Then from the case hypothesis and the form of the one- 
premise structural rules, we verify that the new S is correct. In the altered 
figure, the rank of the mix is one less than in the original. Hence by the 
hypothesis of the induction on the rank, we can find a proof of its con¬ 
clusion, and hence of the original endsequent, without mix. 

Case 10 a: S x is by a succedent structural rule S with M as the C or 
the D. Let the premise be n -*■ O x . From the form of the succedent 
structural rules, we verify that then 0 1M and 0 M are identical. 

_ L 5 _ 1 _Mix 

n -»Q Mix n > 2 m ”► 

n, s m -*■ 0 M , £2. 

Case 11 a: S x is by a one-premise logical rule L. The inference by any 
one of these rules which gives S x has the form 

A x , r —> @, A 2 

e, r-»© s. 

where each of A x , A 2 is either a side formula or empty, and one of E 1( B a 
is the principal formula while the other is empty. 
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Subcase 1 : S 2 is not M. Then M 8 0, since M 8 0, E 2 . We write the 


given figure thus. 

Aj, r —^ 0 , a 2 

Sj, r —» 0, o 2 


L 


a 


Mix 


l 2 , l> F, ®M> “ 2 - 

We alter to this (explanation follows). 


Aj, r —> 0, A 2 


a 


Ai, r, ► ©j,t> a 2M , 


Mix 


Ai, I\ S M 0 M , Q, A 2 


Si, T, S- 


M 


© M , Q >H *2 


L 


iij, r, S M > ®m> ^2) 


The new L has T, S M as its T and @ M , 0 as its 0. The conclusion of the 
new L is the original endsequent, except for the order of formulas within 
the succedent. This enables us to infer that, if L is —^ V or 3 —the 
restriction on variables is satisfied for the new application, since by the 
pure variable property of the given proof, the b cannot occur in the 
original endsequent. If the given proof is intuitionistic, then (by Lemma 
32a) 0, A 2 consists of at most one formula. But M8 0; hence A 2 is 
empty. Therefore the TCI steps preceding the new L require no violation 
of the intuitionistic restriction on —► T. (The possibility that the new L 
is a -*■ -i violating the intuitionistic restriction is ruled out a priori, by 
comparison of its conclusion with the given endsequent; but in fact L 
cannot be -»• -i.) The new mix is of rank one less than the original. 


Subcase 2 : S 2 is M. Then E x is empty, and we omit it in writing the 
given figure thus. 

Aj, T —► 0, A 2 t 


0, M 

r, s 


M 


@ M , IT 


Mix 


Since E 2 is M, then A 2 is not M (and hence M 8 0, since M 8 0, A 2 by B2.1). 
Because Case lb is excluded, M $ Q. We use these facts in writing the 
mixes in the altered figure thus. 


a x , r 


0, A 2 


o 


a x , r, 
Ai, r, s 


M 


0ji> A 2 , o 

®M> A 2 


Mix 


r, s M © M , q,m 


Q 


r, s M , © M , q, o 


Mix 


r, —*■ © M , it 
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If L is -*■ V or 3 the restriction on variables is satisfied because 
T, 2 M , 0 M , Q, M all occur in the given pure variable proof elsewhere 
than above the conclusion of the original L. (But L cannot be 3 —*.) 
This subcase cannot occur intuitionistically, since the sequent T —v 0 , M 
occurring in the given proof has more than one M in its succedent. Now 
the upper mix in the altered figure is of rank one lower than the original 
mix; so by the hypothesis of the induction on rank, it can be eliminated. 
Then the lower mix will have no mix over it. Since M ££!, A 2 , the left 
rank will be only 1 , while the right rank will be the same as in the original 
mix. Since by hypothesis the left rank was originally ;> 2, the rank of 
this mix is less than that of the original; and hence by the hypothesis of 
the induction on rank, it too can be eliminated. 

Case 12a: S x is by a two-premise logical rule L, except the in- 
tuitionistic 3 The inference which gives S x may be written 

An, F 0 , A 12 A 21 , ► 0 , A 2 2 £ 

Si r_ 0 Ho 

b —i. T' W , l—<2' 

Treatment is similar to Case 11 a. Both premises are mixed with 2 -*■ Q 
(noting that M £ 0), prior to the L in the altered figure. 

Case 13: S 2 is by the intuitionistic 3 Since M £ 0, and 0 is 
at most one formula, 0 is M. Only the second premise B, T -*• M is 
mixed with 2 -*■ Q in the altered figure. 

B 2 . 2 . Cases 9b— 12 b. Statement and treatment symmetric to 
Cases 9a—12a, except as follows. The intuitionistic verifications for 
Case lib are immediate. (Subcase 2 can arise intuitionistically.) The 
treatment of the intuitionistic 3 —► can be included in Case 12b by 
writing 0° in the first premise; the verification that the new 3 -*■ has 
an empty sequent for its 0 ° is obtained by observing that intuitionis¬ 
tically the O of the given mix is simply M. 

The following application of Theorem 48 was given for the intui¬ 
tionistic system by Curry 1939 . 

Theorem 49. If a formula E is provable in the intuitionistic ( classical ) 
system H, it is provable using only the "D-postulates {the 3- and -1 -postulates) 
and the postulates for the logical symbols occurring in the formula, provided 
that in case V occurs but not & the 'i-postulates include Axiom Schema 
9a of Lemma 11 § 24. 

Proof. Using Lemma 33a, no logical rules can be applied in a proof 
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in G 1 without cut, except rules for logical symbols occurring in the 
endsequent. The theorem follows by Theorems 46 (using the first part 
only), 48 and 47 (both parts), if no variable occurs in E both free and 
bound; otherwise, use also Remark 1 (c) § 33. 

Curry’s book 1950 (not available during the writing of the present 
§§ 77—80) contains contributions to the theory of Gentzen-type systems, 
and a full bibliography. Curry uses the name “elimination theorem” for 
the Hauptsatz. Our name “normal form theorem” is intended to suggest 
the intent of the theorem, when it is merely explained, without giving 
details of Gentzen’s system, that the normal form is one for proofs; 
and we have kept “elimination theorem” for cases when a symbol or 
notation is being eliminated as in § 74. (But “normal form” has the similar 
disadvantage that it commonly refers to one for formulas as in §§29, 76.) 

The next two sections, in which two applications of the theorem are 
presented, may be read independently of each other. 

*§ 79. Consistency proofs. For Gl or G2, we call an application 
of one of the structural rules a structural inference ; of one of the logical 
rules of the propositional calculus, a propositional inference ; of one of 
the additional logical rules of the predicate calculus, a predicate inference. 

Under special assumptions as to the form of the endsequent, a still 
further normalization of proofs car. be achieved, as in the following 
“extended Hauptsatz” given by Gentzen 1934-5 for his classical 
system. (It is closely related to Herbrand’s theorem, 1930 .) As Gentzen 
remarked, the proof is an illustration of the possibilities for permuting 
inferences in his systems. These possibilities are discussed further in 
Curry 1952 for the classical system of his 1950 , and in Kleene 1952 
for the classical and intuitionistic systems described here. For example: 

Theorem 50. Given a sequent containing only prenex formulas and 
in which no variable occurs both free and bound, and given a proof of this 
sequent in the classical system Gl (the intuitionistic system Gl without 
V —*■), another proof in the same system of the same sequent can be found 
which contains no cut and in which there is an (occurrence of) a sequent S 
(called the midsequent) such that no quantifiers occur in S, and the part of 
the proof from S to the endsequent consists solely of predicate and structural 
inferences. The new proof is a pure variable proof. Similarly reading “G2 ”, 
“mix” for “Gl”, “cut”, respectively. 

Proof, reducing the theorem to a lemma. We suppose Theorem 48 
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already applied, so that we are dealing with a given pure variable proof 
in G1 without cut (or G2 without mix). By the subformula property, only 
prenex formulas can occur in sequents of this proof. 

Consider any axiom in the given proof containing quantifiers, say 
that V comes first. We can replace the axiom (left) by the following 
figure (right), 

VxA(x) -*• VxA(x), A(b) -*■ A(b) 

VxA(x) —► A(b) ^ ~^Y 
VxA(x) -*■ VxA(x), 

where b is a variable not previously occurring in the proof. The formula 
A(b) of the new axiom has one less quantifier than the formula VxA(x) 
of the original axiom. Similarly, if 3 comes first. By induction on the sum 
of the numbers of quantifiers in the formulas of all the axioms, we can 
thus eliminate axioms containing quantifiers altogether from the given 
proof (retaining its other mentioned features). 

Now suppose, as we state in the following lemma, that we can rear¬ 
range the logical inferences in this proof (retaining its other features) 
so that each predicate inference follows all the propositional inferences. 
Then there will be (an occurrence of) a sequent S 2 below which the proof 
contains only predicate and structural inferences (with no branchings, 
as the rules considered are all one premise rules) and above which only 
propositional and structural inferences. There may be occurrences of 
formulas with quantifiers in S 2 and above. However none of them is as 
side formula for a propositional inference, since the resulting principal 
formula would not be prenex. None of them is as principal formula, 
since there are no predicate inferences in this part of the proof. None of 
them is in an axiom, as previously arranged. Hence each postulate 
application in the part of the proof down to S 2 inclusive will remain 
correct, if we alter this part of the proof by suppressing every occurrence 
in it of a formula containing a quantifier, except that some of the 
structural inferences may thereby become identical inferences (with 
premise and conclusion the same) and can be omitted. This alteration 
will replace S 2 by a sequent S containing no quantifiers, but from S we 
can pass to S 2 by zero or more T and I steps, and thence by the unaltered 
part of the proof using only predicate and structural inferences to the 
original endsequent. Thus we obtain the new proof as described in the 
theorem, with S as the midsequent. 

Lemma 40. Given a pure variable proof in the classical system G 1 (the 
intuitionistic system G\ without V->) without cut of a sequent containing 



462 


CONSISTENCY 


CH. XV 


only prenex formulas, another such proof of the same sequent can be found in 
which each predicate inference follows every propositional inference. The 
same sequents occur as axioms in the new proof as in the given proof. Simi¬ 
larly reading “G2”, “mix” for “G 1 ”, “cut”, respectively. 

Proof of the lemma. Consider any predicate inference in the given 
proof, and count the number of propositional inferences occurring in 
the branch leading from its conclusion down to the endsequent of the 
entire proof. The sum of these numbers, for all the predicate inferences 
in the proof, we call the order of the proof. We establish the lemma by 
induction on the order. 

If the order is not 0 , there is some predicate inference such that a 
propositional inference occurs below it with no intervening logical 
inference. 


Case 1 : the predicate inference is an —* V, and the first propositional 
inference below it is a one-premise inference L. As noted above, using the 
subformula property, the VxA(x) of the —* V cannot be the side formu¬ 
la of the inference L. Hence the given figure is as shown at the left. Classi¬ 
cally we alter this to the figure shown at the right. 


Pt-» Q x , A(b) 

r x -* @ 1( VxA(x) 

r 2 -»o, vgg7o7 L 

r 3 -> <D, VxA(x), @ 3 . 


r x -» Q lt A(b) 

A(b), VxA(x) 
r 2 -*<D,A(b),VxA(x), 0 2 
r 3 ^d>,VxA(x), Q„A(b) 
r 3 -* <D, VxA(x), 0 3 . 


The -* T is to take care of the possibility that the TCI steps in the 
given figure include a -* C with VxA(x) as the C. The pure variable 
property of the given proof ensures that the restriction on variables is 
satisfied for the —* V in the altered figure. Intuitionistically 0 1( <D, 0 2 
and 0 3 are all empty, and then we omit the —► T in constructing the 
altered figure. The alteration decreases the order of the proof by 1 . 


Case 2: -* V followed by a two-premise propositional inference 
L. Classically the treatment has only to be changed to show an extra 
premise (either left or right) above the L, preceded in the altered figure 
(except when L is by a —*7\ Intuitionistically L can only be 

ID-* with the extra premise on the left. 

Cases 3 — 8 . The classical and intuitionistic treatment of the cases 
for V —* followed by either a one- or a two-premise propositional 
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inference is essentially symmetric to the classical treatment for —> V. 
The four cases for -» 3 and 3 —*■ follow dually. 

Example 1. Theorem 50 does not hold for the intuitionistic system 
G 1 with V —For consider the following proof. 

a{d) 57(a) d[b) 57(/>) 

57(a) -*> 3x57(;c) 57(£) -+ 3x<^(x) ^ ^ 

57^ V a{b) -+ lx^(x). 

Suppose there were another proof as described in the theorem. Then 
the midsequent S would be of the form II -»• O where II consists of zero 
or more occurrences of <57 (a) V <57 (b) and O either is empty or is < 57 (t) 
for some term t, since reading upward from the endsequent to S, the 
TCI steps (if any) can only suppress, duplicate or permute formulas 
but not introduce new formulas, and the predicate inferences (if any) 
can only be an -*• 3 which (reading upward) changes 3*57(.r) to < 57 (t). 
But then if II or O were empty, II — > ® could not be provable in G1 
without cut, by Lemmas 33 a and 33b used as in Example 4 § 78. So consider 
e.g. the case II — *■ O is 57(a) V <57(/>) —57(t). The term t may be a or b 
or neither. Consider e.g. the case t is a. Then by Corollary Theorem 47, 
57(a) V 57 (b) 3 57(a) would be provable in the propositional calculus H\ 
and by Theorem 4 § 25, so would be 57 V B 3 57, contradicting Theorem 9 
§ 28. 

The consistency theorem. Gentzen illustrated the use of his 
extended Hauptsatz in establishing consistency results such as had 
previously been obtained by Ackermann, von Neumann and Herbrand. 
Bernays 1936 and Hilbert and Bernays 1939 stated a general consistency 
theorem for axiomatic theories, which they based on the Ackermann 
treatment. We state such a theorem now. 

The theorem can be used to infer metamathematically the consistency 
of an axiomatic theory from number theory, geometry or algebra, 
when a model for the theory, i.e. more precisely for its notions and 
axioms (§ 14), can be established constructively, i.e. in finitary terms. 

The theory is to be formalized as a formal system, the terms and 
formulas of which are constructed using the logical symbolism of the 
predicate calculus H with certain individual symbols e 1( ..., e 9 , function 
symbols f v ..i T and predicate symbols P 1( ..., P s . The system may 
have a finite or infinite number of axioms in addition to the postulates 
of the predicate calculus H. 

To establish a constructive model, we must start with a domain D of 
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objects which is either finite (and non-empty) or enumerably infinite. 
We could take D to be the natural numbers N in every case, by selecting 
some fixed effective enumeration of it when it is not already (with rep¬ 
etitions if it is finite), and thereafter dealing with the indices in this 
enumeration in place of the original objects. But in practice it is con¬ 
venient to deal with other domains directly. The case of a finite domain 
admits a simpler treatment from Theorem 20 § 36 extended to include 
individual and function symbols in the valuation procedure. 

In establishing a model, the next step is to interpret the individual, 
function and predicate symbols in the domain D, i.e. to chose objects 
e v ..e a from D as values of e x , ..., e 4 , functions / x , ..., f r with 
independent variables ranging over D and values in D as values of 
f x , ..., f„ and logical functions or predicates P v ..., P s with independent 
variables ranging over D as values of P x , ..., P s . For the model to be 
constructive, / x , ..., f r must be effectively calculable functions and 
P lt ..., P s effectively decidable predicates. We say in this case that 
e v .... e Q , f v ..., f r , P v ..., P, is an effective interpretation of e x , ..., e„, 
f x , .... f r , P x , ..., P s in D. (When D is N, by Church’s thesis §§ 60, 62 
we can expect f v .... f r , P v ..., P s to be general recursive, in which case 
we have a general recursive interpretation. However the consistency 
theory does not need to be connected with Church’s thesis, since we 
merely need to recognize at each application of the consistency theorem 
that the particular f lt .... f r , P v ..., P s which we use are effective.) 

Given an effective interpretation, and using the 2-valued truth tables 
for D, &, V, -i (§ 28), we have a valuation procedure by which, given 
any formula A(x x , ..., x„) containing no quantifiers and only the distinct 
variables x x , ..., x n , we can, for each w-tuple x v of objects from 

D as values of x 1( ..., x„, effectively determine the value of A(x x , ..., x n ) 
to be t (true) or f (false). (When D is N, by # #A, C, D § 45, the predicate 
A (x v ...,%„) which the formula thus expresses is primitive recursive 
in / x , ..., f r , P x > ..., P s .) 

For N as the domain D, instead of speaking of the value of A(x x , ..., x n ) 
when x x , ..., x n take the natural numbers x lt ...,%„ as values, it is 
usually more convenient to speak of the value of A(x x , ..., x n ) (where 
x x , ...,x n are the numerals for the natural numbers x lt ...,x n , re¬ 
spectively). When we do this, A(x x , ..., x„) is a formula in the symbolism 
extended if'necessary to include 0 as an individual symbol and ' as a 
function symbol, and the interpretation is extended to give 0 and ' their 
usual values. We can suppose the original symbols not to have included 
0 or ', unless in the original interpretation it receives the usual value. 
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We can do similarly when D is not N. There must be provided a particular 
variable-less term x (the analog of a numeral) for each object x oi D) 
and the symbolism and interpretation are extended (if necessary) to 
include these terms (consistently with the original interpretation). In 
both cases, we shall now say simply “number” for member # of D, 
and “numeral” for the variable-less term x which expresses x. The 
extension of the symbolism (when necessary) is to assist in describing the 
valuation process, and need not apply to the formal system under 
consideration, except when we say so. 

In the model, i.e. after selecting the domain and the interpretation 
of the non-logical constants, the axioms must be true. As the axioms 
will (in general) contain variables, we need a more general sense of ‘true’ 
than that given by the valuation procedure. We shall formulate it here 
only for prenex formulas (Theorem 19 § 35). Consider first a closed 
prenex formula, e.g. for illustration 3y 1 Vx 1 3y 2 Vx 2 3y 3 A(y 1 , x 1( y 2> x 2 , y 3 ) 
where A(y 1( x x , y 2 , x 2 , y 3 ) contains no quantifiers and only the distinct 
variables shown; call this formula “G”. Under the interpretation of 
e x , ..., e„ f 1( .... f r , P 1( ..., P s and the usual meanings of the quanti¬ 
fiers, G is true if and only if there is a number y v such that for each 
number x v there is a number y 2 depending on x x (write it “y 2 (x x )”), 
such that for each number x 2 , there is a number y 3 depending on x 1 and 
x 2 (write it “y 3 (x 1( x 2 )”), such that 

(I) A(y x , x x , y a (xj), x 2 , y 3 (x v x 2 )) 

(where y^xj is the numeral for the number y 2 (x x ), etc.) has the value t. 
For metamathematical purposes, moreover we wish the existence to be 
understood constructively, i.e. to mean that the y v the y 2 (x x ) (for any 
given x x ) and the y 3 {x v x 2 ) (for any given x v x 2 ) can be found. This 
implies that y 2 (x x ) and y 3 (x v x 2 ) are effectively calculable functions. We 
say then that G is effectively true, if there are a number y x and effectively 
calculable functions y 2 (x x ) and y 3 (x v x 2 ) such that for every x x and x 2 , 
(I) is t. (When D is N, by Church’s thesis we can expect y 2 (# x ) and 
y 3 (x i, x 2 ) to be general recursive, in which case we say that G is general 
recursively true.) An open prenex formula shall be effectively ( general 
recursively) true , if its closure is. The name verifiable is given to a formula 
without quantifiers, when for each substitution of numerals for its 
variables the resulting formula takes the value t by the valuation pro¬ 
cedure, i.e. in the present terminology, to an effectively true formula 
without quantifiers. 

Actually our requirement that y 2 (x x ) and y 3 (x v x 2 ) be effectively 
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calculable only emphasizes what would in any case be implied in the 
constructive use of the existential quantifier; and in other respects also 
(left implicit) the hypothesis of the theorem must be satisfied in a con¬ 
structive way to conclude consistency metamathematically, e.g. the 
number y 1 and the description of the functions y 2 (x 1 ) and y 3 (x v x 2 ) must 
be given effectively, and the demonstration that (I) is t for all x 1 and x 2 
must be by finitary reasoning (cf. the remarks on the converse of Church’s 
thesis § 62). 

A prenex formula in which no V-quantifier follows an 3-quantifier we 
shall call an V3 -prenex formula. 

Theorem 51. Let the terms and formulas be constructed using the 
logical symbolism of the predicate calculus H with e 1 , ..., e a , f lt ..., f r , 
P x , ..., P, as non-logical constants. For any given domain D and effective 
evaluation of these constants in D: 

(a) If T are effectively true closed prenex formulas, E is a closed 
V3- prenex formula, and T 1- E in the predicate calculus H, then E is 
effectively true. 

(b) For any formal system S, the postulates of which are those of the 
predicate calculus H and axioms each of which is (or is equivalent in H to) 
an effectively true prenex formula : If E is an V3- prenex formula, and 
b E in S, then E is effectively true. In particular: S is simply consistent. 

Proof of (b) from (a). {b E in 5} {T b E in H, where T is 
some finite list of non-logical axioms of S}-> {r x b E x in H, where r x 
are the closures of the effectively true prenex equivalents of T, which 
are likewise effectively true, and E x is the closure of E} -> {E x is ef¬ 
fectively true} (by (a) with r x , E x as its T, E) -> (E is effectively true}. 
In particular, 1=0 (in the case of the number-theoretic symbolism with 
the usual interpretation), or A & -i A where A contains no quantifiers, is 
not effectively true, and hence is unprovable in 5. So 5 is simply con¬ 
sistent, by the second form of the definition in § 28. 

Proof of (a). If it is in the intuitionistic H that T b E is given, 
then a fortiori T b E in the classical H, since the intuitionistic postulate 
8 1 is provable classically (§ 23). 

Since F, E are closed, all variables are held constant in the deduction 
F b E, and no variable occurs both free and bound in the sequent 
r -*■ E. So by Theorems 46 and 50 there is in the classical predicate 
calculus G 1 a proof of T —► E with the features described in Theorem 50. 
Let h be the level at which the midsequent occurs. 
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Continuing the illustration, say that F is the single formula G above 
and E is Vv^v^wB^, v 2 , w) where B(v 1 , v 2 , w) contains no quantifiers 
and only the distinct variables shown. 

Select any pair v v v 2 of numbers. 

Now we shall carry out a series of acts, one to each predicate inference 
in this G1 proof, beginning with the lowest and working upward. Each 
act will consist in substituting a certain numeral (specified below) for 
every occurrence throughout the tree of each variable occurring free in 
the premise of the inference. When the act is performed on an inference 
by —> V or 3 the inference is of course destroyed. The given proof is 
a pure variable proof, so the b of an application of V or 3 —> does not 
occur below its premise. Hence by Lemma 36, at the stage (call it stage g) 
when the acts have been performed for the inferences the premises of 
which are at or below level g (< h ), each of the original postulate applica¬ 
tions the premises of which are above level g will have been transformed 
into an application of the same postulate, but in the system G 1 with 
the symbolism extended (if necessary) to include the numerals. 

We now specify the choice of the numerals for the substitutions, at 
the same time proving by induction on g that, if g < h , then at the g-th 
stage all sequents up to level g inclusive have the following property 
(call it P) : the antecedents (succedents) contain only formulas of the 
forms shown in the left (right) column 
3y 1 Vx 1 3y 2 Vx 2 3y 8 A(yi, x„ y 2 , x 2 , y 3 ) Vv^v^wB^, v 2 , w) 

Vxi3y 2 Vx 2 3y 3 A(y 1 , x v y 2 , x 2 , y 3 ) Vv 2 3wB(u 1( v 2 , w) 

3y 2 Vx 2 3y 3 A(yi, t x , y 2 , x 2 , y 3 ) 3wB(u 1( v 2 , w) 

Vx 2 3 Y 3 A ( 3 ; i. tj, y 2 (t 1 ), x 2 , y 3 ) B(® lf v 2 , s) 

3 Y3 A (3 ; i> tj, y 2 (h)> y 3 ) 

A (^i> ti> y^i)> ^2> yai^v O) 

where t 1( t 2 and s are variable-less terms, t x and t 2 are the numbers 
expressed under the given effective interpretation by t x and t 2 , respective¬ 
ly, and y v y 2 (^i)> v i an< i v i are the numerals for the numbers 

Yi> yS l). yS\> V 1 and v 2 respectively, where y v y 2 (Xj) and y 3 (x v x 2 ) 
are the number and effectively calculable functions given by the hy¬ 
pothesis that G is effectively true. 

Basis : g = 1 . Only the endsequent is at or below level g, and it 
has property P. 

Ind. step: g> 1. For g> h the induction proposition holds vac¬ 
uously. For g < h by the hypothesis of the induction, as the tree figure 
now stands, the sequents below level g have property P. We distinguish 
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cases according to the kind of inference which applies from level g to 
level g —1. 

Case 1: —V. In our illustration, the principal formula must have 
one of two forms, e.g. (using the second) Vv a 3wB(i; 1 , v 2 , w). The side 
formula is then 3wB(u 1 , b, w) where b is the b of the original —*• V. We 
substitute v 2 for b wherever it occurs, i.e. only in the side formula of 
the premise (at level g) and in sequents above level g. By this act, property 
P is established at level g, since the side formula becomes 3wB(u 1 , v 2 , w) 
which is one of the allowed succedent forms for property P, and all the 
other formulas in the sequent are duplicates of respective formulas in 
the conclusion, which by hyp. ind. were already of allowed forms. 

Case 2: V-*-. In our illustration, the principal formula must be 
of one of two forms, e.g. (taking the second) Vx^yjA^j, t x , y 2 (t i), x 2 , y 3 ); 
and the side formula is then of the form 3y 3 A(_y 1 , t 1( y 2 {t x ), t*, y 3 ) where 
t* is a term resulting by previous substitutions from the t of the original 
inference. By the hyp. ind., no variable occurs free in sequents below 
level g, and so only the variables of t* occur free in the premise. We 
substitute 0 for each of these (if any), if the domain D is N, and other¬ 
wise some specified numeral, wherever they occur, i.e. only in the side 
formula (at level g) and above level g. The side formula becomes 
3y 3 A(yi, tj,^).^, y 3 ) where t 2 is the result of this substitution on 
t*, and the other formulas at level g are unchanged and were already of 
allowed forms for property P. 

Case 3: —>-3. Similar to Case 2 (with one possibility for the form 
of the principal formula in our illustration). 

Case 4: 3 In our illustration the principal formula may have 
one of three forms, e.g. 3y a Vx 2 3y 8 A(y 1 , t x , y 2 , x 2 , y 3 ); and the side 
formula is then of the form Vx 2 3y 3 A(y x , t x , b, x 2 , y 3 ) with the original b. 
We substitute the numeral y 2 (t x ) for b. 

Case 5: T, C or I. The premise already has property P, because 
it contains no formula not occurring in the conclusion; and no act is 
performed to change the situation. 

This completes the specification of the acts, and the proof that at 
stage g (<, h) sequents up to level g inclusive have property P. So at 
stage h, when the whole alteration of the given proof in the classical 
predicate calculus G\ has been completed, the original midsequent, 
since it contained no quantifiers, has become of the form 
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A(yi, tjj, y 2 {t ij), t 12 , ^ 3 (^ 11 , ^ 12 ))) • • • > A(y 1( t ?] , y 2 (tn), t £2 , y 3 {tj\> ^ 2 )) 

-*• B(« lf u 2 , Si), ..., B(©j, v 2 , s m ). 

The tree figure down to the midsequent has become a proof of this 
sequent in the propositional calculus G\ with numerals added to the 
symbolism. So by Corollary Theorem 47, 

A(,yi. tii, yzttn), t 12 , y 3 {t lv ^ 12 )) & • • • & A(v x , tjj, y 2 {t n ), t i2 , y 3 {t n , t t2 )) 

3 v 2 , Si) V ... V B(iq, v 2 , s m j 

is provable in the propositional calculus H, and hence by Theorem 9 
§ 28 (and Theorem 4 § 25) is identically true when its distinct prime 
parts are treated as distinct proposition letters for the valuation procedure 
of the propositional calculus. In particular, it takes the value t when 
we assign to the distinct prime parts (which contain no variables) the 
values t or f which they take under the given effective interpretation of 
the non-logical constants. By hypothesis, (I) takes the value t for every 
pair x v x 2 of numbers. Hence by the valuation tables for &, D and V, 
one of v 2 , Sj), ..., B(tq, V 2 , s m ) must be t. Let the first which 

is be B(v lf v 2 , s a ). The variable-less term s a then expresses a number s a 
such that B(i>j, v 2 , s a ) is t. 

The whole process by which, after choosing the numbers v 1 ,v 2> we 
obtain this number s a is effective; so s a = w(v v v 2 ), where w(v lt v 2 ) is 
an effectively calculable function. Then for every v v v 2 , 

(II) B(iq, v 2 , w(v v v 2 )) 

is t, i.e. E is effectively true, as was to be shown. 

Remark 1 . The above construction enables us to say something about 
how the function w(v v v 2 ) is related to the functions and predicates 
f v ..., f r , P lt ..., P s , y 2 (x 1 ), y 3 (x v x 2 ), irrespective of the nature of the 
latter. Thus, with or without the hypotheses of effectiveness in the theorem, 
we have when D is N: (a) w is primitive recursive in f lt ..., f r , P lt ..., P s , 
y 2 , y 3 . For we can show by induction on g that, if g < h, at stage g 
each term t occurring free in the tree and containing exactly p 
distinct variables expresses, as v v v 2 vary, a function t(v v v 2 , u lt . . ., ti v ) 
explicit in f v .. ,,/ r , y 2 , y 3 and constants. Hence (for i = 1 , ..., m) s £ 
expresses a function s,(zq, v 2 ) explicit in the same; and each prime 
part of B(tq, v 2 , s £ ) expresses a predicate explicit in the same and 
Pi, ..., P s , whence using # # A, C, D § 45 B^, v 2 , s,) expresses a predicate 
B t (v 1 , v 2 ) primitive recursive in f u ..., j T , P lt . .., P s , y 2 , y 3 . Thence (a) 
follows by applying #F with B t as the and s { as the (no cp m+1 being 
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required). More specifically: (b) w is explicit in f 1 , ..., f r , y 2 , y 3 , -j-, •, sg, 

the representing functions of P v ..., P„ and constants (# # 1, 2, 9). 

Applications. Theorem 52. For the primitive recursive predicate 
R(x, y), formal system S, and formulas A(x) (x = 0, 1,2, ...) described in 
Example 2 § 73: A(x) is provable in S, only if ( Ey)R(x , y). (In fact: Every 
'i'^-prenex formula provable in S is effectively true under the intended 
interpretation.) 

Proof. The axioms of S are all verifiable (hence effectively true), 
when the domain is the natural numbers and the non-logical constants 
0, ', fj, ..., 4, = are interpreted in the intended way (which is effective), 
i.e. by 0, ', <p x , ...,<p fc , =, respectively. So (b) applies. Using it with 
A(x), i.e. 3yf ft (x, y)=0, as the E: If b A(x) in S, then A(x) is effectively 
true under this interpretation, i.e. there is a y for which f k (x,y) —0 is t 
(i.e. for which cp k (x, y) = 0), i.e. (Ey)R(x, y). 

Theorem 53. (a) Robinson’s system (Lemma 18b § 49), call it S, is 

simply consistent (and every V3~prenex formula provable in S is effectively 
true), (b) In S, for a given primitive recursive predicate R(x,y), and a 
formula R(x, y) which numeralwise expresses it obtained by the method 
of proof of Corollary Theorem 27 §49: b 3yR(x, y) only if (Ey)R(x,y). 
(c) In S, if the formulas A (a, b) and B (a, c) of Example 1 § 61 are obtained 
by the method of proof of Corollary Theorem 27, then (conversely to (52)): 
{bB (x)}^(Ey)W 0 (x,y). 

Proof. It will suffice to prove the theorem for the classical systems, 
as 1 =0 or 3yR(x, y) or B(x) is provable in the intuitionistic system only 
if provable in the classical. 

(a) Each of the thirteen axioms except *137 (or *136) is verifiable 
under the usual (effective) interpretation. Employing *90 § 35, we 
obtain 3b(a=0V a=b’) as an V3-prenex equivalent of *137. This is 
effectively true, with b(a) = a— 1, since then for each natural number a, 
a—0 V a=(b(a))' is t. 

(b) The consistency theorem does not apply immediately, because 
3yR(x, y) is not an V3-prenex formula. However we shall show (following 
the lemma) that we can add new axioms to S containing new predicate 
symbols, and extend the interpretation effectively to the latter, in such 
a way that each of the new axioms is equivalent in the predicate calculus 
to an effectively true prenex formula, and b S(x, y) ~ R(x, y) in the 
resulting system S' where S(x, y) contains no quantifiers and expresses 
R(x, y) under the interpretation. Then (b) will follow thus: 
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{b 3yR(x,y) in S} {b 3yR(x, y) in S'}-*{b 3yS(x, y) in S'} 

-* (Ey)R( x >y)- 

Lemma 41°. Let A(x, y) be a formula containing no quantifiers and 
only the distinct variables shown, suppose given an effective interpretation 
of its non-logical constants, and let A (x, y) be the predicate which it expresses 
under this interpretation. Let A(x) be a new predicate letter, (a) Suppose 
that 

( 1 ) (Ey)A (x, y) -* (Ey) y S „ (X) A (x, y) 
where x>(x) is an effectively calculable function. Let 

(2) A(x) m (Ey)A(x, y). 

Let A(x) be interpreted by A{x) (which by ( 1) and (2) is effective). Then the 
formula 

(i) A(x) ~ 3yA(x, y) 

is equivalent in the predicate calculus to an effectively true prenex formula. 
Similarly with "x v ..., x„, y v ..., y m ” in place of “x, y”. (b) Similarly 
with universal instead of existential quantifiers (and “ in place of 
in (1)). 

Proof of Lemma 41. (a) Unabbreviating ~ in (i), and using *96, 

*89, *97 and *91, we obtain as an V3-prenex equivalent 

(ii) Vy3z[{A(x) D A(x, z)} & {A(x, y) D A(x)}]. 

This formula is effectively true with z(x, y) = \lz z <^ {x) A(x, z). 

(b) Similarly using *95, *89, *98 and *91, we obtain 
Vy3z[{A(x) D A(x, y)} & {A(x, z) D A(x)}], 
which is effectively true with z(x, y) = \lz z <^ (x) A(x, z). 

Proof of Theorem 53 (b) (concluded). By the proof of Corollary 
Theorem 27 § 49 from Theorem 27, it will suffice now to show, for the 
proof of Theorem 27, that S can be extended so that b P( x i» • • •, x »> w) ~ 
P(x x , ..., x n , w) with P(x 1 , ..., x n , w) interpreted by y(x v ..., x„)—w. 

Case (Vb). By hyp. ind., S is already extended to include a predicate 
symbol Q(x 2 , ..., x„, w) interpreted by ty(x 2 , ..., x n )—w such that 
b Q(x 2 , ..., x n , w) ~ Q(x 2 , ..., x n , w), and t R(y, z, x 2 , ..., x n , w) in¬ 
terpreted by x(y, z,x 2 , ..., x n )=w such that b. R(y, z, x 2 , • • •, x «, w) ~ 
R(y, z, x 2 , ..., x B , w). We introduce (if we have not already) a<:b, 
to be interpreted by a<b, with the axiom a<b ~a<b, i.e. a<ib ~ 
3c(c'-bd=/>), using c <, b as the bound y ^ u(x 1( x 2 ) for Lemma 41. We 
further extend S (if we have not already) to include a predicate symbol 
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B(c, d, i, w), to be interpreted by (3(c, d, i)—w, such that b B(c, d, i, w) 
~B(c, d, i, w); referring to the definition of B(c, d, i, w) accompanying 
*(180) § 41, this can be done by several steps (left to the reader) similar 
to those we perform next. Now w r e introduce D(c, d, i, x 2 , .... x n ), in¬ 
terpreted by (Eu)(Ev)[$(c, d, i')=u & $(c, d, i)=v & y(i, v, x 2 , .. ., x n )=u], 
with the axiom D(c, d, i, x 2 , ..., x n ) ~ 3u3v[B(c, d, i', u) & B(c, d, i, v) & 
R(i, v, x 2 , . . ., x„, u)], using the bounds u < (3(c, d, i'), v<$(c,d,i). 
Next we introduce E(c, d, y, x 2 , ...,x n ), interpreted by 
{i)[i<y -> (Eu)(Ev)[$(c, d, i')=u & 1 3(c, d, i) = v & y(i, v, x 2 , ..x n )=u]], 
with the axiom E(c, d, y, x 2 , .... x„) ~ Vi[i<y D D(c, d, i, x 2 , ..x n )], 
using the bound i <> y. Then we introduce F to eliminate the first 3u 
of P(y, x 2 , ..., x„, w), using u <, (3(c, d, 0); and finally P to eliminate the 
3c3d, using as bounds c <, C and d < D with C and D as described in 
Example 1 (A) § 57 when rj(y, x 2 , ..x n ) = <p(y, x 2 , ..., x n ). By Theorem 
I Case (Vb) (C) § 49, the predicate P(y, x 2l ..., x n , w) assigned to interpret 
P(y, x 2 , ..., x n , w) is indeed <p (y, x 2 , ..x n )=w ; and by the equivalences 
introduced as axioms or already established, 

b P(y, x 2 , ..., x„, w) ~ P(y, x 2 , ..., x„, w). 

Proof of Theorem 53 (c). Similarly, with a quantifier elimination 
of Vc following those for Theorem 27. 

Theorem 53 (a) gives the consistency property required for the first 
proof of Theorem 54 in § 76 (and Theorems 53 (b) and 52 for the proofs 
in Remark 2 §76). 

Theorem 55. The formal number-theoretic system of Chapter IV, 
under the restriction on the induction schema that in the A(x) the x should 
not occur free within the scope of a quantifier, is simply consistent. (Acker- 
mann 1924 - 5 , von Neumann 1927 ; cf. Hilbert and Bernays 1939 pp. 121 , 
122 and 127.) 

Moreover: Every V3-prenex formula provable in it is effectively true, 
and this remains the case upon adjoining to it Robinson’s axioms (Lemma 
18b) and the axioms used in the proof of Parts (b) and (c) of Theorem 53, 
which parts apply to this system also. 

Proof, reducing the theorem to a lemma. It suffices to prove the 
theorem for the classical system. We shall first treat the case the A(x) 
is restricted to contain no quantifier at all. Then Lemma 42 will complete 
the proof. 

Consider an axiom by Postulate 13 in this system, say its A(x) is 
A(x, x 2 , ..., x„) with exactly the distinct variables shown and no 
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quantifiers. Let A (x, x 1 , ..., x„) be the (primitive recursive) predicate 
expressed by A(x, x x , ..., x n ) under the usual interpretation. Using 
*89 and *98, the axiom is equivalent to 

3y[A(0, x lf ..x n ) & (A(y, x v ..x n ) D A(y', x v ..x n )) 

3 A(x, x v ..., x„)]. 

This formula is effectively true, with 

vlT X = f 0 if A ( X > X V • ■ •. *n) V A(0, x v . . .,X n ), 

T ’ v \ \Ly ySx A (y, x v ..., x n ) & A (y', x v ..., x„) otherwise. 

Lemma 42°. The class of the provable formulas {or of those deducible 
from given assumption formulas) in the system of Theorem 55 is not di¬ 
minished by further restricting the A(x) for the induction schema to contain 
no quantifier at all. 

Proof. Any formula A(x) not containing x free within the scope 
of a quantifier must be composed by operations of the propositional 
calculus from formulas A x , ..., A mi , each of which contains no quantifiers 
but may contain x, and formulas A mi+1 , ..., A mi+ „, 2 , each of which 
contains quantifiers but does not contain x free (m lf m 2 ;> 0; m — 
m 1 J r m 2 ;> l).. By Theorem 11 §29 on principal disjunctive normal form 
(with Theorem 3 § 25), A(x) is equivalent to a disjunction of formulas 
A iX & ... & A im (i — 1, ...,«) where each A i; . is either A } . or —i A^ 
depending on i, if the first case of the normal form applies. For the 
moment suppose m v m 2 ^ 1. Write “B,(x)” for A^ & ... & A imi , 
"Cfor A !>mi+1 & ... & A im , and “B(x)” for some refutable formula 
without quantifiers. Then using *48 and *34 § 27, A(x) is equivalent to 

(a) B(x) V (Bj(x) & Cj) V ... V (B„(x) & C„). 

Now consider any formula of the form (a) with n ^ 0 where (as above) 
the B’s contain no quantifiers and the C’s contain quantifiers but no 
free x’s; call n its degree. We prove by induction on the degree that any 
axiom by Postulate 13 with A(x) of this form is provable using Postulate 
13 only with A(x)’s containing no quantifiers. 

Ind. step: n > 0. Write (a) as 

(b) D(x) V (B„(x) & C„). 

The axiom under consideration is then 

[D(0) V (B„(0) & C„)] & Vx[D(x) V (B n (x) & C„) D D(x') V (B„(x') & C n )] 

(c) D D(x) V (B„(x) & C„). 

We shall show that (c) is deducible in the predicate calculus from the 
two following axioms, 
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(d) [B„(0) V D(0)] & Vx[B n (x) V D(x) D B„(x') V D(x')] D B„(x) V D(x), 

(e) D( 0 ) & Vx[D(x) D D(x')] D D(x); 

i.e. we are to show: (d), (e) b (c). By Theorem 14 §33 with *45 and 
*34 § 27: C n b (c) ~ (d); and with *47 and *48: -iC„ b (c) — (e). No 
variables are varied since C n does not contain free the variable x of the 
quantifier Vx within which replacements are performed. So, using 
V-elim.: (d), (e), C„ V -iC„ b (c); and hence using *51: (d), (e) b (c). 

But in (e) the A(x) is of the form (a) with degree n — 1, and likewise in 
(d), when B n (x) V B(x) is taken as a new B(x). 

When m x = 0, the method applies construing "B,(x)'’ and “B,(x) &” as 
the empty expression, with the following differences. Consider 

(f) C„ & Vx[C„ 3 C„] DC, 

Using *46: C„ b (c) ~ (f); and using *48: -iC„ b (c) ~ (e). But using 
*1 §26, *75 §35, *45, and *1 again: b‘ (f)- 

When w 2 = 0 , the A(x) already lacks quantifiers. 

If the second case of the normal form applies, then using *53, *34 and 
*53, A(x) is equivalent to B(x)&-iB(x). 

Remark 2 . In the systems of Theorems 52, 53 and 55, every provable 
V3-prenex formula is primitive recursively true. For R(x, y) and R(x, y) 
as in Theorem 53 (b), {b 3yR(x, y)} -> {there is a primitive recursive 
function 9 such that (x)R(x, 9 (#))}. Proofs. By Remark 1 , and our 
constructions of the functions and predicates for the effective inter¬ 
pretation and effective truth in Theorems 52, 53 and 55. 

Example 2. Let S be the number-theoretic formal system with 
omitted. What the adaptation of Presburger 1930 cited at the beginning 
of § 42 gives directly is an effective correlation, to each closed formula A 
of S, of another B, with properties ( 1 ) — (4) as follows. (To consider an 
open formula, let A be its closure.) ( 1 ) b A ~ B in S. ( 2 ) B is either 
true or false, under an obvious extension of the valuation procedure 
(preceding Theorem 51). Now say A is true (false), if B is. (This special 
definition of truth for closed formulas of S is equivalent to the one called 
for under the general definition of truth which will be taken up at the end of 
§ 81, by (3).) (3) Truth and falsity in this sense obey the 2 -valued truth 
tables.; {3xA(x) is true} = (£x){A(x) is true}; and {VxA(x) is true} = 
(x){A(x) is true}. (4) According as B is true or false, B is provable or 
refutable in 5. Thus: (5) S is simply complete, and complete with respect 
to the interpretation (§§41, 29). ( 6 ) There is a decision procedure for the 
question whether A is true. (7) If 5 is simply consistent, there is a de- 
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cision procedure for S, i.e. for the question whether \- A in S. Now we 
show: ( 8 ) If b A in S, then A is true; so S is simply consistent. Suppose 
b A in S. Then for some finite collection T of non-logical axioms of S, 
T b A in H. We can use Lemma 41 to eliminate the quantifiers from 
A and from the induction formula A(x) of each induction axiom among T, 
taking e.g. for a part 3yA(x, y) 

u(%) = ;xy[({HyA(x, y) is false} & y=0) V A(x, y)]. 

We thus obtain P, 0 b A' where 0 are the formulas (i) of Lemma 41 
with 3 or V for the predicate symbols introduced in the elimination. 
Then A' (and hence A) is true, by Theorem 55 (or the first part of its 
proof) for the system with 0 adjoined as axioms. 

Discussion. In these applications of the consistency theorem, the 
domain D is the natural numbers, the numerals are already a part of 
the symbolism, and the effective interpretation is the intended one. 
Hilbert and Bernays 1939 pp. 38 — 48 give two applications of their 
consistency theorem to geometrical axiom systems. There certain enumer¬ 
able systems of complex numbers are used as the domains D. 

These consistency proofs all depend on having a model for the axioms, 
as did those given before the advent of Hilbert’s proof theory (cf. § 14). 
But giving a model for the axioms in intuitive arithmetical terms does 
not establish beyond all doubt that no contradiction can arise in the 
theory deduced from the axioms, unless it can also be demonstrated 
that the reasonings in the theory can be translated into intuitive arithmet¬ 
ical reasonings in terms of the objects used in the model. This demon¬ 
stration is what the present consistency theorem (Theorem 51) adds to 
the earlier treatment. (Cf. Bernays 1936 pp. 115—116 and Hilbert- 
Bernays 1939 p. 48.) 

We did take the deduction of the theory from its axioms into account 
in the discussion of consistency in the first part of § 75, but there we were 
using non-finitary set-theoretic methods. 

The consistency theorem depends, via the extended Hauptsatz (Theo¬ 
rem 50), on the Hauptsatz (Theorem 48) and the reduction of H to G\ 
(Theorem 46). By these, given a proof of a formal theorem in a system 
based on the predicate calculus with mathematical axioms, it is possible 
to alter the system and proof until we have a proof which contains no 
formulas more complicated than the axioms and theorem themselves, 
i.e. only subformulas of them. In this situation there is no excursion 
through "ideal” statements in proving a "real” theorem from "real” 
axioms (§ 14). 
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Our metamathematical consistency result of Theorem 55 for a part 
of number theory of course holds good upon extending the system 
by further effectively true prenex axioms, e.g. one could add symbols 
for further primitive recursive functions with their recursion equations. 
These were eliminable in the full system (Example 9 §74), but presum¬ 
ably are not in general in the system with the restricted induction 
schema. 

One may also consider some new induction postulates, which would 
hold as derived rules in the full system, but presumably do not when 
Postulate 13 is restricted as for Theorem 55, and which are susceptible 
of like treatment under corresponding restrictions. Examples of such are 
given in Hilbert and Bernays 1934 pp. 343 — 346, which however appear 
from Skolem 1939 and Peter 1940 to be derivable when suitable primitive 
recursive functions with their recursion equations are adjoined. 

All such consistency results obtained by strictly elementary methods 
must stop short of giving the consistency of the number-theoretic 
formalism with the unrestricted induction schema, as we know from the 
famous second Godel 1931 theorem (Theorem 30 § 42), according to which 
the consistency of that system cannot be established by methods 
formalizable in the system itself. 

Gentzen’s consistency proof for number theory. We shall now 
give a brief heuristic account of the method used by Gentzen (1936, 
1938 ) in a proof of the consistency of classical pure number theory with 
the unrestricted induction postulate. 

In the proof of Gentzen’s 1934-5 Hauptsatz (Theorem 48 § 78), 
we used a triple induction, consisting of an induction on the number of 
mixes, within the induction step of which (in establishing the principal 
lemma) we used an induction on the grade, within the basis and induction 
step of which we used an induction on the rank. This triple induction can 
be regarded as a single “transfinite induction”, if our system of ordinal 
numbers, hitherto consisting simply of the natural numbers, is extended 
sufficiently into the transfinite. 

Beyond the natural numbers or "finite ordinals” a next number or 
"first transfinite ordinal” called w is supplied. Then new numbers 
o>+1, 0 + 2 , ... are obtained by use of the successor operation, after the 
infinity of which still another called 2co shall follow. Repeating this 
process, after all these infinite sequences of numbers, each isomorphic to 
the natural number sequence, and starting respectively with 0, w, 2a>, 
still another number called co 2 shall follow; and so on, thus. 
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0 , 1 , 2 , ...; co, a)-f- 1 , M-f- 2 , . ..; 2 m, 2 a>-(- 1 , 2 co-j- 2 , .. . ; ... ; ; 
co 2 , m 2 +1, co 2 +2, . . . ; m 2 4-m, m 2 +m+ 1, m 2 +m+2, . ..; 
m 2 +2m, co 2 +2(o-}- 1, <o 2 -f 2co+2, . . . ; .. . ; ; 

2co 2 , 2 m 2 +1, 2m 2 +2, . .. ; 2m 2 +m, 2m 2 +m+1, 2m 2 +m+2, . . . ; 

2m 2 +2m, 2m 2 +2m+1, 2m 2 +2m-F2, . . . ; . . . ; ; . . . ; ; ; m 3 ,_ 

This figure is intended only to suggest the manner of generation and 
the notations, up to a certain point. The general theory of transfinite 
ordinals forms a part of Cantor’s abstract set theory (cf. 1897). A set 
linearly ordered (§ 8) so that each non-empty subset has a first element is 
called well-ordered. Two ordered sets M and N are similar (M ~ N), if 
they can be put into a 1-1 correspondence preserving the order. Cantor’s 
ordinal numbers arise by abstracting from well-ordered sets with respect 
to similarity, in the same way as his cardinals arise by abstracting from 
sets with respect to equivalence (§ 3 ). However, while the whole system of 
Cantor’s transfinite ordinals requires a set-theoretic approach, the theory 
of initial segments (at least of not too great ones) can be handled in a 
finitary way. 

For example, the system of the ordinals < m 3 can be represented as the 
triples of natural numbers, in a certain ordering. Let a — (a, b, c) = 
aix> 2 -\-bu>-\-c be any such triple; the last notation is the one customary 
in the theory of ordinal numbers < m 3 . The order relation between two 
such triples (as ordinals < m 3 ) is defined thus: 

09 < a 2 = («!< a 2 ) V (a ± =a 2 & b x < b 2 ) V {a l —a 2 & b 1 =b 2 & qcc^). 

In other words, the ordering of the triples (a, b, c) is alphabetical, with 
an infinite alphabet consisting of the natural numbers. 

Now the triple ordinary induction on number of mixes a, grade b and 
rank c can be considered as a single transfinite induction with (a, b, c) 
— aM 2 +&M+c as the induction number, thus. In a transfinite induction 
up to oj 3 , to prove that all ordinals < m 3 have a property, one shows 
that, for any ordinal a < m 3 , if all ordinals [} < a have the property, 
then a has the property also. We are using the more compact statement, 
in which basis and induction step are combined (cf. * 162 b § 40 for this 
form of the statement of ordinary induction). The case a = 0 , for which 
the set of the (3’s is empty, can be treated separately as the basis, if one 
wishes. The induction is of the course-of-values type, where now whenever 
a > m, there are an infinitude of preceding p’s. For the Gentzen Hauptsatz, 
the reasoning is that, if the theorem is true for all proofs with induction 
number (a, b, c) = (3 < a (< m 3 ), it is true for a proof with (a, b, c) = a. 
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The induction number a = (a, b, c) now well-orders the cases of the 
theorem in an order in which they are being proved, just as the natural 
number used as induction number does in an ordinary induction. 

Conversely, using the definition of ordinals < od as triples (a, b, c), 
every transfinite induction up to w 3 can be accomplished by ordinary 
inductions. 

One can define in a finitary manner somewhat higher ordinals. Clearly 
we can go up to « n for any finite n. After all these ordinals as next we 
take co<*>; eventually a>w w ; and so on. 

Gentzen’s discovery is that the Godel obstacle to proving the con¬ 
sistency of number theory can be overcome by using transfinite in¬ 
duction up to a sufficiently great ordinal. His transfinite induction is 
up to the ordinal called e 0 by Cantor, which is the first ordinal greater 
than all the ordinals in the infinite sequence w, <o», .... It figures 

in Cantor’s theory as the least of the solutions (called e-numbers) for \ of 
the equation = £. 

Gentzen works with systems of his type with sequents, but they have 
Hilbert-type equivalents. In the 1938 version of his consistency proof, 
the simple consistency of the system is identified with the unprovability 
of the sequent —►. For from —► any sequent can be deduced by thinnings, 
while conversely from and the provable sequent A & - 1 A —*• 

one can infer -► by a cut. He begins by correlating to each proof in 
his system an ordinal number < e 0 . Taking the induction in the form of 
an infinite descent (cf. *163 § 40), he shows that, given any proof of the 
sequent another proof of with a lesser ordinal can be found. So 
his system (and hence ours) is simply consistent. 

In the 1936 * version, his sequents each have exactly one succedent 
formula. He shows, by transfinite induction on the ordinal (<e 0 ) of a 
proof, that a certain kind of reduction can be carried out on any provable 
sequent. It is absurd that this reduction should be performable on 
-*■ 1 = 0 . The performability of this reduction is offered as a finitary 
meaning which can be attributed to the ideal statements of classical 
number theory (§ 14). 

Ackermann 1940 uses transfinite induction up to e 0 to carry through 
a proof of the consistency of elementary number theory in another 
manner using Hilbert’s e-symbol (originally proposed by Hilbert, and 
carried by Ackermann in 1924-5 as far as showing the consistency 
with the restricted induction schema). 

Just as transfinite induction up to to 3 can be reduced to ordinary in¬ 
duction, so can induction up to e 0 , as is done formally by Hilbert 
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and Bernays 1939 pp. 360 ff. But there is the difference that in reducing 
the latter a predicate is used as the induction predicate (the A(x ) of (I) 
§ 75, or the P(n) of § 7) into which there enters explicitly a predicate Q 
defined by an induction similar in nature to that defining M in Theorem 
VIII § 57 and hence probably not arithmetical (§ 48) or equivalently not 
“elementary” (Theorem VII § 57). It can in fact be anticipated from the 
second Go del theorem (Theorem 30) that transfinite induction up 
to e 0 cannot be reduced to ordinary induction within the system, since 
the reasonings used in Gentzen’s consistency proof other than this trans¬ 
finite induction are of sorts that are formalizable in the system (whence 
in particular no formula of the system can satisfy the equivalences 
defining Q). In his last paper 1943 , Gentzen proves the non-reducibility 
of induction up to e 0 directly, instead of indirectly from Godel’s theorem 
with his consistency proof. 

The original proposals of the formalists to make classical mathematics 
secure by a consistency proof (§§ 14, 15) did not contemplate that such 
a method as transfinite induction up to s 0 would have to be used. To 
what extent the Gentzen proof can be accepted as securing classical 
number theory in the sense of that problem formulation is in the present 
state of affairs a matter for individual judgement, depending on how ready 
one is to accept induction up to e 0 as a finitary method. (Cf. end § 81.) 

Gentzen in 1938a speculates that the use of transfinite induction up 
to some ordinal greater than s 0 may enable the consistency of analysis 
to be proved. By a result of Schiitte 1951 , stronger forms of induction 
are obtainable thus; in fact for any ordinal a, induction up to the least 
Cantor e-number greater than a cannot be reduced to induction up to a 
(but induction up to any intermediate ordinal can be). 

§ 80. Decision procedure, intuitionistic unprovability. Given 
the conclusion B of an inference by the modus ponens rule (Rule 2) of 
the formal system H, we cannot determine the premises A and A D B, 
because the A will be unknown. Similarly, given the conclusion 
A, T-» A, 0 of a cut in G\, and the analysis of the conclusion specifying 
how its antecedent is separated into the A and the T and its succedent 
into the A and the 0, we cannot determine the premises A —> A, C and 
C, T 0, because the C will be unknown. 

However, for each of the rules of the propositional calculus G 1 except 
the cut (or of G 2 except the mix), given the conclusion of an inference 
by the rule and the analysis of the conclusion, the premise(s) for the 
inference are ascertainable. Using this fact with Gentzen’s normal form 
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theorem (Theorem 48), we shall obtain a decision procedure for the prop¬ 
ositional calculus, which unlike the truth-table procedure (§§ 28—30, 
Theorem 12) works also for the intuitionistic system as well as for the 
classical. 

The steps in the procedure will consist in listing the choices of the 
premise(s) for the inference of a given conclusion. In doing this, it is 
tedious to have to distinguish all the ways of applying the structural 
rules TCI. Therefore, for use in our version of Gentzen’s decision proce¬ 
dure, we shall introduce a new Gentzen-type system G3, in which the 
structural alterations TCI are not counted as separate inferences. We 
define G3 for the predicate calculus also, although it is only for the 
propositional calculus that we shall have a decision procedure. 

In order in G3 to dispense with the TCI rules, we must construe the 
postulates of G3 to apply irrespective of the order and number of repeti¬ 
tions of formulas in the antecedents, and classically in the succedents. 
In other words, for G3 any postulate application shall remain an 
application of the same postulate when any sequent is replaced by a 
sequent ‘cognate' to it in the following sense: Two sequents T—0 
and r-*0' are cognate, if exactly the same formulas occur in T 
(in 0) as in T' (in 0'), provided intuitionistically that 0 and 0' neither 
consist of more than one occurrence of a formula and are hence the same. 

Example 1. The sequents C, Cl, S & <£?, CZ-* S and 2 & d, Cl, C-> 3 , 8 
are cognate classically, but the latter sequent is not employed intu¬ 
itionistically. 

For the classical system G3, the postulate list differs from the list given 
for G2 thus. The axiom schema is replaced by 

C, r-*0,C. 

There are no structural rules of inference; and each logical rule of 
inference is modified by retaining the principal formula in the premise(s). 
For example, -*• -i, D -» V and -i —► become: 

A, r-*@, -i A A D B, T 0, A and B, A D B, T 0 

T“►©,-!A. A DB,T0. 

r -*• 0, A V B, A or T -► 0, A V B, B -iA, T-^0, A 

F-*0, A V B. -i A, T-*0. 

(For D both premises are to be used; for —*■ V, written now combining 
the two rules into one statement, one or the other.) 

For the intuitionistic G3 the postulate list is as follows. 
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Postulates for 

the intuitionistic formal system G3 

Axiom schema. 


c, r C. 

Rules of inference for the propositional calculus. 

A, r-*B 

A D B, T A and B, A D B, T -► 0 

r^ADB. 

ADB,r-»0, 

r -> A and T -*• B 

A, A & B, T -*■ 0 or B, A & B, T -► 0 

T-»A&B. 

a & b, r-*@. 

r-*A or r B 

A, A V B, r -*• 0 and B, A V B, T 0 

r-»AVB. 

AVB, T-*0. 

a, r-* 

-iA,r-*A 

r-vnA. 

~iA, r-^0, 

with 0 empty or 
consisting of one formula. 

Additional rules of inference for the predicate calculus. 

r A(b) 

A(t), VxA(x), T-*0 

f VxA(x), 
subject to the 
restriction on variables. 

VxA(x), r-*0. 

r -► A(t) 

A(b), 3xA(x), T-» 0 

r 3xA(x). 

3xA(x), T -► 0, 
subject to the 
restriction on variables. 

We also define classical and intuitionistic systems G3a. These differ 
from the systems G3 in that we permit arbitrary omissions of formulas 
in the antecedent and succedent of the premise(s) for an inference by 
any one of the rules. 

The system G3 is designed to minimize the number of choices of prem¬ 
ise^) for a given conclusion, when we are attempting to exhaust the 
possibilities for proving a given endsequent, especially in showing the 
endsequent to be unprovable. When the endsequent is provable, the 
use of G3a usually permits shortening the sequents used in the proof. 
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A proof in G3 is irredundant, if it contains no pair of cognate sequents 
one occurring above the other in the same branch. 

Theorem 56. (a) If h T —> 0 in G3a (a fortiori if V T 0 in G3), 

then b T —> 0 in G2 (or G1 ), using exactly the same-named logical rules 
as in the given proof in G3a and no mix (or cut). 

(b) Conversely, if b T—^ 0 in G2 (or G1 ), and no variable occurs both 
free and bound in 0, then b T -*► 0 in G3 (a fortiori in G3a), using 
only rules the same-named as logical rules used in the given proof in G2 
(or Gl). Lemmas 32a—33b (stated above for G\ and the cut, and for G2 and 
the mix) hold also for G3 (and G3a). 

(c) A formula E containing no variable both free and bound is provable 
in H, if and only if there is an irredundant proof in G3 of the sequent —> E. 

(d) A decision procedure (or algorithm) for determining whether or not a 
proposition letter formida E is provable in the propositional calculus H is 
afforded by the process of attempting to construct an irredundant proof of 
—> E in G3. According as such a proof is found or is determined not to exist, 
E is provable in H or is not provable in H. 

Proofs, (a) An axiom of G3a is provable by TI steps from an axiom 
of G2. Given any inference in G3a, by TCI steps we can bring its premises 
to the standard form shown in the postulate list for G3. Then the cor¬ 
responding rule of G2 applies, with the given conclusion or a conclusion 
which leads to it by TCI steps. 

Example 2 . The inference in the intuitionistic G3a shown at the 
left is then accomplished in G2 as shown at the right. 

B, -Ie3 C. ^ “I S —► <37 

*- “i “ o 

-Ic si, “1 B —*■ 

(b) By Theorem 48 (and Lemma 34), we can take the given proof 
to be in G2 without mix. An axiom of G2 is an axiom of G3 ; and we easily 
verify that any inference in G2 without mix can be performed (by one 
or more-steps) in the system obtained from G3 by adding the six TCI 
rules. Hence it will suffice to show that these additions to G3 do not 
increase its class of provable sequents. For this purpose, we first show by 
induction that, if b T 0 in G3, then |- T —> 0, C in G3, provided 
intuitionistically that 0 is empty, i.e. we establish that -*■ T holds as a 
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derived rule for G3. In this induction, the restriction on variables for an 
-*• V or 3 -> is met by first using Lemma 35 (which holds for G3 as well 
as for G1 and G2) to change the b in its free occurrences in the given 
proof of the old premise to a new variable not occurring free in C. The 
rule T —► is handled similarly; and —*■ C, C ——*■ I, I —*■ immediately 
by the convention under which the rules of G3 are used. 

(c) By (a), (b) and Theorems 46 and 47, E is provable in H, if and 
only if there is a proof of —► E in G3. But given any proof of E in 
G3, we can find an irredundant one, as we show by induction on the 
number of pairs cf cognate sequents one above the other in the same 
branch. Given such a pair, the lower sequent of the pair and the in¬ 
tervening sequents, together with all branches contributing to either, 
can be suppressed. 

Proof of (d) is postponed until after the following example illustrating 
the decision procedure. 

Example 3. (a) Is GiV —\Gl provable in the intuitionistic prop¬ 

ositional calculus H ? By (c), it is if and only if there is an irredundant 
proof of -*• G7 V -i G? in the intuitionistic G3. We attempt to find such a 
proof thus. The sequent G7 V -i G?, as we see by inspection, is not an 
axiom of G3. The only rule of inference of G3 applicable with —G7 V -iG7 
as conclusion is —V. Intuitionistically the inference of —*G7V-iG7 
by this rule can have as premise only —► G7 or -iG7. Neither is an axiom, 
the first —G7 can be the conclusion of no inference in G3, while the second 
— > —i G? can only come by -i from the premise G7—or a premise 
such as G7, G7 —> cognate to G? However since two cognate sequents 
are interchangible for proofs in G3, it suffices to consider G7 —This 
sequent is not an axiom, and no inference of G3 is possible with it as 
conclusion. The entire construction is shown below in the figure, on three 
lines or levels numbered upward from the given endsequent —G7 V -i G7. 
Briefly, we apply the rules of G3 upward from conclusion to premises, 
in all possible ways not distinguishing between cognate sequents. 

3. d —► 

2 . —^ G7 or —*■ —i d 

1. — > G7 V —i G7. 

In this construction, we have exhausted all the possibilities for finding 
a proof of —■►c^?V-iG? in the intuitionistic G3, without finding one. 
Hence G7V-i<G7 is not provable in the intuitionistic II. 
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(b) We already know that <37 V -i <37 is provable in the classical 
propositional calculus H (*51 § 27). However it is of interest to see 
how the decision procedure leads us to a proof of <37 V -i c37 in the 
classical system G3, whence if we followed out the proofs of Theorem 56 
(a). Theorem 47, and of results in Chapter V on which the latter depends, 
we should be led to one of <37 V -j <37 in the classical H. At Line 2, the 
formula c3Vi<37 is now retained in the succeedent (owing to the dif¬ 
ference between the classical and intutionistic -*■ V rule of G3). This gives 
us more possibilities at Line 3. The “ 2 ” written there signifies that the 
sequent listed below it at Line 2 is one of the possible premises at Line 3 
(by —* V). As we are seeking an irredundant proof, we need consider 
further only the new premise —► <37 V -«c7, <37, -i c7 or <37 —* c37 V -i <37, 

-i <37. The construction is shown below up to Line 4. 

4. 3 or <37—>-<37 V —1<37, <37, —1<37 3or<37 —»-<37 V-i<37, <37, — i<37 3or<37—*-<3? V-i<37, <37, -i<3? 

3. 2 or —*■ <37 V“i(3?, <37, -i <37 2or —* <37 V — i<37, <37, —t <37 or <37 —*■ <^7 V —i<3?,- 1 <37 

2. —»• <37 V -1 <37, <37 or —► <37 V -1 <37,-i <3? 

1. —»• <37 V — i <37. 

At Line 4, three of our series of choices have ended above in an axiom; i.e. we 
have discovered proofs of <37 V -t <37 in G3. Say using the left choice at 
Line 2 (and the new choices above that), we have in particular the following 
proof in G3 (left). The sequents of this can be simplified in G3a (right). 

4. <37 —* <37 V ~i <37, <37, -i <37 4. 

3. <37 V-i<37, <37,-i <37 ^ 3. 

2. —► <37 V -i <37, <37 ^ 2. 

1. —► <37 V -i <37. * 1. 

In terms of G 2 (or Gl) the latter becomes: 

etc. 

3. —*• <37 V -i <37, <37 

2. —*-c7V-i<37 ( c7V-'<37 

1. —*• <37 V ~i <37. 

(c) We already know that -i-i (<37 V -i <37) is provable intuitionistically 
in 7 / (*5la). A proof of -*• -i-i(<37 V -i<37) in G3a follows which is dis¬ 
covered by use of the decision procedure. In terms of G2, the contraction 
which we could not perform in (a), because of the intuitionistic restriction 
to not more than one formula in the succedent, is possible now as it is 
performed in the antecedent. 


V 

-»c 


<37 —► <37 


<37, -i <37 


<37 V-«<37, 


<37V-i<37. 


V 

V 
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7. 

6 , 

5. 

4. 

3. 

2 . 

1 . 


-* V 

<3 -> V —1 (C? 

<3, -I (c^? V “I <3) —*• ~^ 

-I(e2 w 

-i(J?V-i«SQ-* 

—* -i-i (G? V —i <C?). 


Proof of Theorem 56 (d). It is clear that the procedure, as il¬ 
lustrated in Example 3, can be completed up to any desired level, noting 
the following fact. Given any sequent T -*■ © composed of proposition 
letter formulas, to each choice from T or from 0 of a formula containing 
a logical symbol to serve as the principal formula, there are exactly one 
or two incognate choices of premise(s) for the inference of T 0 in G3. 

It remains to prove that the whole procedure must terminate. By the 
subformula property of G3, every sequent in a proof of -► E in G3 must 
be composed of subformulas of E. But a proposition letter formula E has 
only a finite class of subformulas; and from these there are only a finite 
number k of ways of choosing formulas to occur in antecedent and to 
occur in succedent, i.e. at most k incognate sequents can be written down 
formed of subformulas of E. Therefore an irredundant proof of —► E 
cannot exist having more than k levels; so we can exhaust the possibilities 
for finding such a proof by completing the procedure up to (at most) 
the A-th level. 

Example 4. Is V <£7 2 ) 3, -i-ic^ V -i-i<£7 2 provable in the 

intuitionistic propositional calculus? We start out as follows. 

3. Vc^fj)—► —i(<£ 7 1 V<C? 2 ) or-i-i (<£7 x Vc3f 2 )—► -i-ic^or —i-i(<C7 1 V <^? 2 )—* —i —i 

2. -i—i (<s3fj V G7 2 ) -i-i V -i-i G? 2 

1. —►“i-i(e^f 1 V <CT 2 ) 3 -i-ic^fj V-i-ic3f 2 . 

Without carrying this procedure further, we can now answer the question 
in the negative. For we can quickly verify by 2-valued truth table methods 
that none of the sequents at Line 3 is provable. Thus the first of them 
-i-i (<£7 X V c 7 2 ) -*• -i (<^ x V <C? 2 ) is only provable in G3 and hence (Theorem 
56 (a)) in Gl, if -i-i(G? 1 V <C7 2 ) 3 -i(<3 1 V <C? 2 ) is provable in H (Corollary 
Theorem 47). But-i-i (G? x V <C7 2 ) 3 -i (<CT X V <C? 2 ) is not provable in H, since 
it assumes the value f when <C? 2 take the values t, t (Theorem 9 § 28). 
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Theorem 57. (a) In the intuitionistic propositional calculus H, for 

any formulas A and B: b A V B, only if (- A or (- B. (Godel 1932 .) 

(b) Each of the numbered results *14, *15, *49, *51, *52, *55—*62, 
which were established in Chapter VI only for the classical propositional 
calculus, does actually fail to hold for the intuitionistic propositional cal¬ 
culus (and likewise the converse of each implication among *49a—*62a). 

Proofs. Part (a) is immediate from the form of V in G3. For (b) 
we can show by the decision procedure that each formula in question 
is unprovable when A and B are simple proposition letters d and 2, 
as we have already done for *51 in Example 3 (a). (For *51 the unprov¬ 
ability also follows from Part (a) with Theorem 9.) However for the 
others it is more expeditious to take advantage of deductions in the 
intuitionistic propositional calculus. For example, if *49 held, by Remark 
1 § 27 so would *51, contradicting our result for *51. The others we 
consider in their numerical order. 

*14. Suppose -iA 3B b nB D A did hold in the intuitionistic 
propositional calculus for all formulas A and B. Then in particular we 
would have -yd'D—yd \- -y-yd d, whence using *1 we would 
get b -1 “i d ^ d, contradicting our result for *49. 

*56. (Cf. *56a.) If b - i(iA& - iB) 3 AVB intuitionistically for 
all A and B, then in particular b — 1 (— 1 & —1 Cl) "D d\ d, whence 
by *37 and *38 again b ~~y~id D Cl. A fortiori by &-elim., not 
b AVB~-i(-iA&-iB) intuitionistically for all A and B. 

*62. (Cf. *62a.) If b “i(A & B) D -iA V-iB intuitionistically for 
all A and B, then in particular b ~i {Cl & -1 Cl) z> -1 d V - 1-1 Cl , whence 
by *50, b -yd V -y-yd. By (a) of the theorem, then either \- -id or 
\- -[-yd, contradicting Theorem 9. 

Remark 1 . Likewise the theorems and corollaries of Chapter VI 
marked with 0 as being established only classically can (with one ex¬ 
ception) be inferred by the present methods to fail intuitionistically. 
Thus Theorem 8 would give *49; its corollary would give *55 from *54; 
Theorem 11 would give the equivalence of m Cl to one of d V -1 d, 
d, —yd and d&-yd] etc. Exception: Theorem 12 is obviously false 
for the intuitionistic propositional calculus in the sense that the in¬ 
dicated procedure does not apply. That no other truth table procedure 
with finitely many values applies was shown by Godel 1932 . There does 
exist a decision procedure of another kind (Theorem 56 (d)). 
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Predicate calculus. By Theorem 54 § 76, there is no decision pro¬ 
cedure for the predicate calculus. (At what point does the proof of 
Theorem 56 (d) fail, when we attempt to apply it to the predicate cal¬ 
culus? Contrast Examples 2 and 3 (a) §78.) Although Theorem 56 (c) 
thus does not afford a decision procedure in the case of the predicate 
calculus, nevertheless it is useful in investigating provability in the pred¬ 
icate calculus. Given a predicate letter formula E, by attempting to 
find an irredundant proof in G3 of E, we may actually find one, or 
we may discover some feature of the situation which shows that there 
cannot be any. 

Theorem 58. In the intuitionistic predicate calculus H: 

(a) “v-i Vx(J?(x) V -ic 5I(x)) is unprovable. (Heyting 1930 a; Kleene 1945 
with Nelson 1947 .) 

(b) (i) Vx(<£7 V £(*)) D d V VxB(x) is unprovable, 

(ii) -i-i{Vx(<3 V B(x)) D Cl V VxB(x)} is provable, but 

(hi) m y{yx{III(y) V B{x)) D Cl{y) V VxB(x)} is unprovable. 

(c) In each table of Corollary Theorem 17 § 35, when A(x) is the simple 
predicate letter Cl{x), the implication {and hence the equivalence) of a formula 
above a line by {to) one below the line is unprovable, and likewise the double 
negation of that implication {and hence, by *25, of that equivalence) when a 
double line separates the formulas. (Heyting 1946 .) 

(d) Each of the numbered results *83—*85, *92, *97—*99 {Theorem 17), 
which were established in Chapter VII only for the classical predicate cal¬ 
culus, does fail to hold. Of them, *83, *92 and *97 hold, but *84, *85, *98 
and *99 fail to hold, when double negation is applied to the formula. 

Proofs, (a) We attempt to construct an irredundant proof of 
-+■ -i-\Vx{Cl{x) W-\Cl{x)) in the intuitionistic system G3 as follows. 
For abbreviation, we let “B” stand for Vx(<C?(x) V -1 <£?(*)) at certain 
places. From Line 3 to 4, either -iB or B can be the principal formula 
of the inference. If — 1 B (using -1 —►), the premise is what is already ob¬ 
tained at 3 (the original B disappearing as the 0 and a new occurrence 
of it appearing as the side, formula). For the definition of what constitutes 
a proof of a given sequent in G3, all variables not occurring in the sequent 
are on a par; hence at Line 4 it suffices to list the premise for the -*• V 
with the particular variable b v Then similarly at Line 8 we choose 
another particular variable b 2 , which must be distinct from b x to satisfy 
the restriction on variables for the —► V; etc. 
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* 11 . 

10 . 

9. 

8 . 

*7. 

6 . 

5. 

4. 

*3. 

2 . 

1 . 


iB, afa), c b 

7 7 or -iB, <3(£ 2 ) -> 

7 or -iB, cT^) -► <3(/> 2 ) or -.B, 

7 or 

-iB, ^{bj-* B 
3 3 or iB, —► 

3 or —iB -*■ <C?(Zq) or —iB —*■ -i ^(bj) 

3 or nB-^^V-i^j) 

-iB—*• B 
-iB 


From the portion of the structure shown, the general pattern is clear. 
We fail to have an axiom in the first new sequent -iB, ^(bj) —*■ cT(/> 2 ) 
of Line 9 because b x and b 2 are distinct variables; similarly in the first 
new sequent -iB, cC?(/q), <^(b 2 ) —> <07(i» 3 ) of Line 13; and so on. All other 
sequents occurring are even more obviously not axioms. Note the form 
of Lines 3, 7, 11, ...; at Line 3+4n for n = 0, 1,2, ... we have closed 
out all the possibilities for finding an irredundant proof of —*--i-iB 
other than by finding one of -iB, eT(/q), ..., GZ(b n ) —*■ B with distinct 
variables b v ..., b n . Thus the search for an irredundant proof of —*■ -i-iB 
in G3 will never terminate successfully, but will only lead us upward 
through an infinite regression of increasingly complicated sequents. 
Otherwise expressed, we establish that there is an irredundant proof 
of -* Ti B in G3, only if there is a shorter one of -»B -*■ B, a still shorter 
one of -iB, <3(/q) -*■ B, a still shorter one again of -iB, GZ{b^), <3(b 2 ) -*• B, 
ad infinitum. Since it is absurd that there should exist an infinite suc¬ 
cession of successively shorter proofs starting with a first proof, we 
conclude that there is no proof of —> —i—i B in the intuitionistic system 
G3; and hence by Theorem 56 (c), none of —i—i B in the intuitionistic 
predicate calculus H. 

(b) (i) We attempt to construct a proof in G3, with the first step 
uniquely determined thus: 

2. V*(e2 V B(x)) <C* V VxB(x) 

1. Vx(<3 V S(x)) 3 cT V VxS(x). 


From the form of the sequent at Line 2, and of the rules of G3, we see 
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that in any sequent above the bottom one only formulas of the four 
forms y ix{ZZ V S(x)), ZZ V B(t) (t a term, i.e. for the pure predicate cal¬ 
culus, a variable), ZZ and B(t) can occur in the antecedent, and only a 
formula of one of the four forms ZZ V VxB(x), ZZ, VxB(x) and B(b) (b a 
variable) as the succedent. The only chances for such a sequent to be an 
axiom are for it to have the form ZZ, T -» ZZ or the form B(t), B(b) 
where t is b. Whenever the two-premise rule V —is applied, the tree we 
are constructing will branch. The other rules which there is a possibility 
of applying after the first step are the one-premise rules V —-*• V and 
—V. In order to have found a proof, for some succession of choices, 
every branch must be terminated above in an axiom. We shall now 
show that, no matter what succession of steps has been performed, an 
axiom will not have been reached along one of the branches. For this 
purpose, we define the designated branch (along which we are to show 
that an axiom cannot be reached) by specifying which premise belongs 
to it at each V thus. Let the principal formula for the V -> be V B(t). 
If the formula of the succedent contains <0% as whole or part, the premise 
in the designated branch (or designated premise) shall be the one with 
B(t) as the side formula; otherwise the one with ZZ as the side formula. 
Now consider the following property P of a sequent, namely that (1) ZZ 
does not occur as one of the antecedent formulas, if the succedent formula 
contains ZZ as whole or part, and (2) for every variable b, B(b) does not 
occur as one of the antecedent formulas, if B(b) (for the same b) is the 
succedent formula. Neither kind of axiom described above has property 
P. Hence, to prove that an axiom cannot be reached along the designated 
branch, it suffices to show that every sequent in the designated branch 
has property P. This we do by induction. For, first, the sequents at Lines 
1 and 2 have property P. Thereafter, as we shall verify next, each in¬ 
ference will preserve property P along the designated branch, i.e. if the 
conclusion of the inference has property P, so does the premise, or in 
the case of an V -*• the designated premise. To verify this, we must 
examine four cases, according to the form of the principal formula of the 
inference. Case 1: 'ix{ZZ V B(x)) in the antecedent. Property P is obviously 
preserved, since the side formula introduced into the antecedent by the 

V ->• is of the form ZZ V B(t) and the succedent is unchanged. Case 2: 
ZZV B(t) in the antecedent. If the succedent contains ZZ as whole or part, 
the introduction of B(t) as side formula for the designated premise of the 

V —>■ preserves property P. If the succedent does not contain ZZ as whole 
or part, the introduction of ZZ as side formula for the designated premise 
preserves property P. Case 3: <CTV VxB(x) as the succedent. Since the 
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conclusion has property P, the formula £3 does not occur among the 
antecedent formulas, by (1). Then the introduction of either ^3 or 
VxB(x) into the succedent as the side formula of the —*■ V preserves 
property P. Case 4: VxB(x) as the succedent. By the restriction on 
variables for the -> V, the variable b of the side formula B(b) must be a 
variable not occurring in the antecedent, which assures that (2) for prop¬ 
erty P remains satisfied. 

(ii) Now consider instead -n{Vxfc3 V B(x)) D ^3 V 'ixB(x)}] call 
it “- 1 - 1 C”. We may now have in antecedents of sequents above the 
bottom line also the formula ~iC, as succedent also C. The induction for 
property P fails in the case for -iC in the antecedent, as the -i —*■ can 
introduce C as the succedent after has previously been introduced 
into the antecedent, leading to a violation of (1). By following out this 
loophole in the previous demonstration of unprovability, we are led 
to the following proof of -*■ -i-i{Vx(c^? V B(x)) D cA V VxB(x)}, which 
we state in G3a. The designated branch, which except by using -iC 
as the principal formula for an -i —we would be unable to terminate in 
an axiom, is the left one. 


11 . 

10 . 

9. 

8 . 

7. 

6 . 

5. 

4. 

3. 

2 . 

1 . 


a-* a 


VxB{x) 


a v*(e2 v B(x)) d a v v*B(*) 


—i C, cA ■ 




-iC , ^3 W B(b)B(b) 
nC,Vx(^VS(j))-»S(i) V ^ Y 
-.C, V*(c7 V B(x)) VxB(x) 

■ C, Vx(<^ V B(x)) -+ <A V Vxff(x) ^ 


C. 


(iii) If we now change the formula to 
—nV_y{Vx(e^r(_)i) V B(x)) Di3(y) V VxS(x)}, the loophole is closed. For 
the €3 obtained in the antecedent before the -i —*■ becomes <A(c) for some 
variable c. Then the £3 obtained in the succedent after -i —=► followed now 
by -*• V with respect to y will become <A( d) where d is a variable distinct 
from c by the restriction on variables for the —*• V. The reader may work 
out the modifications in the demonstration given in (i) to establish the 
unprovability rigorously. 
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(c) For a given table, consider formulas A, B, C, D, where B is A 
or below A, C is immediately below B and separated from it by a line, 
and D is C or below C. Using *2, if 1- A 3 B and \- C 3 D, but not 
h COB, then not (- D 10 A. Similarly, using *24 and *49a, if |- A 3 B 
and |- C OD, but not b —•—«(C 3 B), then not |- -i-i(D 3 A) and 
not b D 3 A. Hence it suffices to treat the upward implications between 
the six pairs of formulas immediately separated by a line, under double 
negation when the line is double. 

Ib 3 la, lib 3 Ila. If either were provable, by substituting for 
<£7(x) (Theorem 15 § 34) and using *75 or *76, or by using Theorem 22 
§ 37 for k = 1, -!-!<£? 3 <£? would be provable. Hence not b Ib 3 la 
and not b Hb 3 Ila. 

-1-1(10! 3 Ib). Let “B(x)” abbreviate <C7(x) V -i<b7(x). By *51a and 
V-introd., b Vx~i"iB(x). Hence Vx-i-iB(x) 3-i-iVxB(x) b ”i-iVxB(x). 
Thence by 3-introd. and contraposition twice (*13, *12), 
b -i-n{Vx-i-iB(x)3 -i-iVxB(x)} 3 -i-iVxB(x). Thus if 
—i-i{Vx -i-iB(x) 3 -i-iVxB(x)} were provable, -i-iVxB(x) would be. 
But by (a), not b “i“iVxB(x); hence not b -i“i{Vx“i“iB(x) 3 
-i-iVxB(x)}; and hence by the substitution rule (Theorem 15) not 
b -!-i{Vx“i-icC¥(x) 3 -i-iVx<3(x)}, i.e. not b -1-1 (Ic 1 ^ Ib). 

—i—i (IIIc 3 IIIb 2 ). Easily reduces to -1-1(10! 3 Ib). 

IIcj 3 lib. By Example 3 § 37 and Example 4 this section. 

Illbi 3 Ilia. Easily reduces to Ilcj 3 lib. 

(d) *83—*85, *92. Included under (b) and (c) (with *25, *92a.) 

*97. We show (d 3 3x2(x)) 3 3x(<C? 3 2(x)) unprovable, similarly 

to Ilci 3 Hb. (After applying Theorem 22 with k = 2, we obtain at 
Line 3: and V S 2 , a 3 2 X V S 2 {a 3 2 X ) V 

{a 3 S 2 )} or 3 Si V S 2 -». a 3 S x or <£? 3 Si V S 2 -* a 3 S 2 . In 
treating the first alternative, it suffices of course to show that the first 
premise 3 V 2 2 -► <£? of the two is unprovable.) We prove 
-i-i {{d 3 3xB(x)) 3 3x(<£? 3 S(x))}, similarly to (b) (ii). (In Lines 4 — 9, 
use successively 3, 3, 3-*, -i-*, 3 -».) Cf. *97a. 

*98. Substituting <£?& —i<£7 for S in -i—i{(Vx<v?(x)3$)3 3x(<CT(x)3S)} 
and using *50 and *44, we get —i—i (IIIc 3 Ilia). 

*99. Reduces to *98 (cf. the first method for Ib 3 la). 

Heyting (1930a p. 65) infers the unprovability of the formula of Theo¬ 
rem 58 (a) and of Ic x 3 Ib from the interpretation of the intuitionistic 
predicate calculus in terms of Brouwer’s theory of sets (end § 13). 
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The proof of Theorem 58 (a) by Kleene 1945 and Nelson 1947 is from 
results to be taken up in § 82. 

The present treatment of (a) and (b) was reported in Kleene 1948 ; 
and some like applications of Gentzen’s theorem are in Curry 1950 
(already in press in 1948). de Iongh 1948 uses the method in establishing 
an intuitionistic classification of those formulas formed from <v7(x, y, z) 
by quantifying x, y, and z and possibly applying negation which are 
classically equivalent to Vx3jyVze2?(x, y, z), analogous to each of the four 
tables of Heyting 1946 for one quantifier (Corollary Theorem 17 §35 
and Theorem 58 (c)). He takes this sequence of quantifiers for illustration, 
because of its role in formulating the notion of convergence of a sequence 
to a limit (cf. § 35 (i) or (ii), omitting the x’s). 

Mostowski 1948 demonstrates the unprovability of — 1 — 1 (Ic x 3 lb), 
and (b) (i), by an interpretation of the intuitionistic predicate calculus in 
terms of “complete Brouwerian” lattices. Henkin 1950 a extends Mos- 
towski’s results to obtain an algebraic characterization of quantifiers 
both for intuitionistic and for classical logic. 

§ 81. Reductions of classical to intuitionistic systems. For the 

rest of this chapter, we use the Hilbert-type systems H. When T is a 
sequence of zero or more formulas, -iT, — 1 — 1 F, r°, etc. shall be the result 
of applying - 1 , - 1 - 1 , 0 (as defined below), etc., respectively, to each of the 
formulas of T. 

The main result of the first part of this section is given in several 
versions, though its significance can be seen from one. The reader desiring 
a simplified treatment may accordingly select: Theorem 59 and Proofs, 
Definition and Discussion of °, Theorem 60 (a) for 0 only and (c), Lemma 
43a and Proof, Proof of Theorem 60 (c), Corollary 2 (omitting the other 
material up to that point). 

Theorem 59. (al) If T (- E in the classical propositional calculus, 
then —1 —1 r p - 1 - 1 E in the intuitionistic propositional calculus. (a2) If 
— 1 P, A p -iE in the classical propositional calculus, then— iT,- 1 - 1 A p -iE 
in the intuitionistic propositional calculus. (Glivenko 1929 .) 

(b) Likewise for the predicate calculus with Rule 9 omitted, and for the 
formal number-theoretic system with Rule 9 omitted. 

Proofs. ' (al) By induction on the length of the given classical de¬ 
duction T p E (i.e. the deduction of E from T which ‘T p E” asserts 
to exist, cf. § 22), using the following observations. If E is an axiom of the 
classical propositional calculus by any axiom schema except 8 , then E 
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is also an axiom intuitionistically, and by *49a §27, |- -i-iE in the 
intuitionistic system. If E is an axiom by the classical Axiom Schema 8, 
then by *5lb, again b -i-iE intuitionistically. Moreover, corresponding 
to Rule 2, -i-iA, -i-i (A 3 B) b niB intuitionistically, using *23 § 26. 
(a2) From (al) using *49b. 

(b) Because the additional axiom schemata and particular axioms 
belong to the intuitionistic as well as to the classical system, we need 
only add a treatment of the additional rule of inference 12. Using *23, 
Rule 12 and *49a: (i) t-i(A(x) 3 C) b _ i _ iA(x) 3 —i — iC b x 3x~i“«A(x) 
3 -1 —iC b “i“i (3x — i—iA(x) 3 -l- iC). Using *49a, *70 § 32 and 

*49a: (ii) b -i-i(3xA(x) 3 3x-v-iA(x)). By *51 b: (iii) b 3 C). 

Combining (ii), (i) and (iii) by *24, -i-i(A(x) 3 C) b x ■ _ i“>(3xA(x) 3 C). 

Example 1 . By (al), each of the numbered results which were 
established in Chapter VI only for the classical propositional calculus 
(see Theorem 57 (b)) holds intuitionistically under double negation 
(applied in *14 and *15 to both formulas). 

Example 2. That *97 holds intuitionistically under double negation 
(which we proved by another method for Theorem 58 (d)) now follows 
from (b) and Theorem 49 §78. 

Corollary (to (a2)). If E is a proposition letter formula containing 
no logical symbols except & and —i, and b E in the classical propositional 
calculus, then b E in the intuitionistic propositional calculus. (Godel 
I932-3-) 

Proof. Consider E as a conjunction of n formulas (n ;> 1) each 
of which is not a conjunction, and is therefore either a proposition letter 
or begins with the symbol -i. By &-elim., each of these n components is 
provable classically. But no proposition letter is provable (by Theorem 9 
§ 28). So each component is a negation, and by Glivenko’s theorem ((a2)) 
is also provable intuitionistically. Hence by &-introd., so is E. 

Definition of °. For the rest of this section, the formulas T, the 
formula E, etc., shall be proposition letter formulas, predicate letter 
formulas, or number-theoretic formulas, according as we are considering 
propositional calculus, predicate calculus, or formal number theory. 
By a prime part of a formula, we mean a (consecutive) part which is 
a prime formula, i.e. one containing no logical symbol. 

For any formula E, we define E° by the following recursion. 1. If P is 
a prime formula, P° is P. 2 — 5. If A and B are formulas, (A 3 B)° is 
A° 3 B°, (A & B)° is A° & B°, (A V B)° is -i (iA° &-iB°), and (iA)° 
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is -iA°. 6 — 7. If x is a variable, and A(x) is a formula, (VxA(x))° is 
VxA°(x) (where A°(x) is (A(x))°), and (3xA(x))° is -iVx-iA°(x). 

Briefly, E° comes from E by replacing (or “translating”) each part of 
E of the form shown below in the first line by the respective expression 
shown in the second. 

A D B A & B A V B -iA VxA(x) 3xA(x) 

“ “ -»(-.A&-iB) “ “ -iVx-iA(x) 

Example 3. Let A(x) and B be prime (and B not contain x free). 

If E is [VxA(x) D B] D 3x[A(x) Z> B] (cf. *98), then 

E° is [VxA(x) D B] Z> -iVx-i[A(x) Z> B]. 

Discussion of °. In the next theorem it is shown that the classical 
systems can be defined within the intuitionistic. In particular, for the 
number-theoretic system, if f- E classically, then [- E° intuitionisti- 
cally. The converse holds obviously (since h E~E° classically), as do 
the converses for Theorem 59 and the other parts of Theorem 60. So a 
formula E is provable in the classical system, if and only if the correlated 
formula E° is provable in the intuitionistic system. We can think of E° 
as resulting from E by changing the logical symbols D, &, V, -i, V, 3 
to D°, &°, V°, -i°, V°, 3°, respectively, where “A D° B” is an abbreviation 
for AdB, "A V° B” for -i(-iA&-iB), etc. The sense in which the 
classical formulas are thus “translated” into intuitionistic ones can be 
emphasized by using different logical symbols (say D c , & c , V c , -i c , V c , 3 C ) 
for the classical system (upper row in the above translation table). 

Definition of ', etc. For the propositional and predicate calculi, 
we use other correlations. Let E' be obtained from E like E° except that 
A D B is translated as ->(A&-iB). Let E* be obtained from E by re¬ 
placing each prime part P by —i—iP; and Et likewise except that P is 
replaced by —«—i P only where it is alone (i.e. when E itself is P), or im¬ 
mediately within the scope of an & or an V, or is the second part of the 
scope of an D; and E* like Et except without the replacements in the 
second part of the scope of D. 

Example 3 (concluded). 

E' is [VxA(x)&-iB] & t-iVx- v-i [A(x)&-iB]}, 

E c 7 is [Vx -i-iA(x) D —i —i B] Z) -iVx —i [ii A(x) D —i —i B], 

E 0 * is [Vx -i-iA(x) D -i-iB] D -iVx *n [A(x) Z> -i-iB], and 

E*' is -i{-i [Vx -i-i A(x) & —i B] & -i-i Vx ti [A(xj & —i B]}. 
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For the propositional calculus, there can be no inverse theorem, giving 
a similar reduction of the intuitionistic to the classical system, in which 
the intuitionistic propositional connectives are defined explicitly from 
the classical. For that would give a truth table decision procedure for the- 
intuitionistic propositional calculus, contradicting Godel 1932 . 

Theorem 60. (a) For any formula E, in the propositional calculus, 

predicate calculus, or number-theoretic formal system, b E ~ E° ~ E' ~ 
E°t ~ E 01 ! ~ E*' classically (by *56, *83, *58, *49). 

(bl) For the propositional calculus, if b E classically, then (- E' 
intuitionistically. (b 2 ) For the number-theoretic formal system, if Y |- E 
classically, then T' b E' intuitionistically. (Godel 1932 - 3 .) 

(c) For the number-theoretic formal system, if Y j- E classically, then 
r° b E° intuitionistically. (Gentzen 1936 p. 532 and Bernays.) 

(d) For the propositional calculus, predicate calculus, or number-theoretic 
formal system, if Y b E classically, then r°t b E°t (also r°t b E°t 
and r*' b E*') intuitionistically. 

Proofs, (bl) Using (a), if b E classically, then b E' classically. 
But E' contains as operators only & and - 1 . Thus (bl) follows from 
Corollary Theorem 59. (Conversely, Corollary Theorem 59 is implied by 
(bl).) 

We shall prove (c) after the first lemma, and then infer (b2). 

Lemma 43a. For the number-theoretic formal system, if F contains no 
logical symbols except 3, &, - 1 , V (in particular, if F is E° for some formula 
E), then b tiFdF (and hence b —1— 1 F ~ F) intuitionistically. (After 
Godel 1932 - 3 .) 

Proof of Lemma 43a, by induction on the number of (occurrences of) 
logical symbols in F. 

Basis: F is of the form s=t where s and t are terms. By *158 §40, 
b s=t V —1 s=t, whence by *49c, b —1 —is=t 3 s=t. 

Ind. step. Case 1 : F is A D B. By hyp. ind. : (i) b “i“vB 3 B. By 
*60g, h: (ii) m(A 3 B) b A 3- 1 —iB. From (ii) and (i) by chain 
inference (* 2 ), -n(A 3 B) b A 3 B, and by 3-introd., 
b — 1—1 (A 3 B) 3 (A 3 B). Case 2: F is A & B. By hyp. ind., 
b — 1 — 1 A 3 A and b -|- ’B 3 B. Use *25. Case 3: F is -iA. By 
*49b. Case 4: F is VxA(x). Use hyp. ind., *69 and b lb 3 Ic x from 
Corollary Theorem 17. 
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Proof of Theorem 60 (c). By induction on the length of the given 
classical deduction T (- E, with cases as follows. 

Case 1 : E is one of the particular axioms 14 — 21, or an axiom by 
any schema except 11. Then E° is E, or is an axiom by the same schema, 
or is deducible in the classical propositional calculus (using *56 with 
Theorem 6 § 26) from an axiom by the same schema: so E° is provable 
in the system of the classical propositional calculus with the other axioms 
and axiom schemata added. (For example, if E is an axiom A 3 (B 3 A) 
by Schema la, then E° is A° 3 (B° 3 A 0 ), which is an axiom by the same 
schema. If E is an axiom AdAVB by Schema 5a, then E° is 
A° 3 -i (iA° & -iB 0 ), which is deducible from A 0 3 A 0 V B° by *56 and 
Theorem 6, since the part to be replaced does not stand within the scope 
of a quantifier.) Hence by Theorem 59 (b), h —i—iE° in the intuitionistic 
number-theoretic system; and hence by Lemma 43a, \- E° in the same. 
Case 2: Axiom Schema 11. Then E is A(t) 3 3xA(x), and E° is 
A°(t) 3 -iVx-iA°(x), which is provable intuitionistically by contra¬ 
position (*13) from the axiom Vx-iA°(x) 3 -iA°(t). 

Case 3: Rule 2. We must show that A 0 , (A 3 B)° b B° intu¬ 
itionistically. But (A 3 B)° is A 0 3 B°. Case 4: Rule 9. Similarly. 
Case 5: Rule 12. With the help of *12 and Lemma 43a. 

Proof of Theorem 60 (b2). By Lemma 43a with *58f §27, any part 
of T°, E° of the form A 3 B is equivalent to -i(A&iB). 

Lemma 43b. For the propositional or predicate calculus, if F contains 
no logical symbols except 3, &, -i, V {in particular, if F is E° for some 
formula E), then I- —i —i Ft 3 Ft {and hence b it Ft ~ Ft) intuitionisti¬ 
cally. 

Proof of Lemma 43b. Similarly to Lemma 43a, using in the basis 
*49b instead of *158 and *49c. 

Proof of Theorem 60 (d). For the number-theoretic system and °t, 
from (c) by Lemma 43a. For the propositional or predicate calculus and °t, 
from Lemma 43b in the same manner as (c) from Lemma 43a. (Using 
Lemma 43a or 43b, *58e and *49b, the result can be modified to 
r°t KE°t; using also *58f, to T*' b E*'.) 

Remark 'l. To show that Theorem 59 does not hold for the predicate 
calculus without the exclusion of Rule 9, and that Theorem 60 (b) does 
not hold for the predicate calculus, consider as an example Wx -i~i <07(x) 3 
—\—\'ix^{x). Call this formula “E”. Then b E classically, but intu- 
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itionistically neither b E nor [- - 1 - 1 E (Corollary Theorem 17 §35 and 
Theorem 58 (c)), and moreover not b E', as from E' by * 49 b and *58b 
we can deduce E intuitionistically. The example D FI shows that 

Theorem 60 (c) does not hold for the propositional or predicate calculus; 
and the example -i-i <37 b that Corollary Theorem 59 and Theorem 
60 (bl) do not hold with assumption formulas T. An example to show 
that Theorem 59 does not hold for the number-theoretic system without 
the exclusion of Rule 9 will have to wait until the next section (Theorem 
63 (iii)), as we have yet no method of demonstrating an example of a 
classically provable but intuitionistically unprovable number-theoretic 
formula. 

Corollary 1 (to (c)). For the number-theoretic formal system, if T, E 
contain no logical symbols except D, &, —i, V, and T b E classically, then 
T b E intuitionistically. (to (d) for °t). Likewise for the propositional 
or predicate calculus, provided also that T, E contain no letter unnegated 
other than as antecedent of an implication. 

Corollary 2 (to (b2), (c) or (d)). The classical number-theoretic formal 
system is simply consistent, if the intuitionistic is. 

Proof of Corollary 2. If 1 =0 were provable in the classical 
system, it would also be in the intuitionistic. 

Discussion. Godel remarks, “The theorem [60 (b 2 ), or now (c)] ... 
shows that the intuitionistic arithmetic and number theory is only 
apparently narrower than the classical; in fact [it] includes the entire 
classical [number theory], merely with a somewhat differing inter¬ 
pretation.” Heyting adds, “However for the intuitionists this inter¬ 
pretation is the essential thing.” ( 1934 * p. 18.) 

de Iongh says, “In our significist opinion the most important advantage 
of intuitionistic mathematics is, that it distinguishes in every instance 
between directly and indirectly proved propositions and analyses the 
mathematical concepts into sequences of concepts with different degree 
of indirectness.” ( 1948 , p. 746.) 

van Dantzig 1947 proposes to investigate how much further the de¬ 
velopment of classical mathematics can be carried within the intu¬ 
itionistic, in the manner just shown to be possible for all of the usual 
elementary number theory. For this purpose, the classical formulas E 
are translated into classical equivalents F which are stable intuitionisti¬ 
cally, i.e. such that b ~'-iF ~ F (cf. Lemma 43a). van Dantzig suggests 
that it may be possible to interpret practically the whole of classical 
mathematics within this stable part of the intuitionistic system. 
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For the consistency problem, the present results can be regarded as 
showing that the intuitionistic number theory is equally in need of a 
metamathematical consistency proof with the classical, or, if one accepts 
the consistency of the intuitionistic system on the basis of its inter¬ 
pretation, as securing the consistency of the classical, system. 

Some formalists point out that the methods of intuitionistic elementary 
number theory go beyond what they consider as finitary (cf. Hilbert- 
Bemays 1934 p. 43 and Bernays 1935 ). It is said that the intuitionistic 
use of negations of complicated formulas, and of implications having in 
the antecedent a complicated formula (e.g. a generality formula, or 
another implication) involves the general logical notion of what is an 
intuitionistic proof. It is by such use of negation and implication that 
Brouwer and his followers are enabled to go much further in the de¬ 
velopment of a constructivistic mathematics than Brouwer’s forerunner 
Kronecker. 

The intuitionists do not attempt to give an exact description of their 
notion of a proof in general, and they say that in principle no such de¬ 
scription is possible. 

The intuitionists' use of negation and implication must then be under¬ 
stood as only requiring us to recognize, e.g., that a particular given 
proof is intuitionistically acceptable, or (when they prove a statement of 
the form (A -> B) -*■ C) that if one should produce an intuitionistically 
acceptable deduction of one statement B from another A, then on the 
basis of it one could by a given method surely construct an intuitionisti¬ 
cally acceptable proof of a third C. 

An attempt is made by Bernays 1938 to defend the Gentzen transfinite 
induction up to e 0 (end § 79), as constituting less of an extension of the 
narrower finitary standpoint than the whole body of intuitionistic 
methods in number theory. 

de Iongh 1948 touches briefly upon current discussions regarding 
intuitionism and related trends (particularly signifies, represented by 
Mannoury 1909 , 1925 , 1934 ). 

Now let us examine the way in which Corollary 2 Theorem 60 gives 
a consistency proof for classical elementary number theory from the 
intuitionistic standpoint. The second part of this proof is a tacit or 
explicit verification that the intuitionistic formal system for number 
theory is correct intuitionistically. 

Since the proof of Corollary 2 Theorem 60 is entirely elementary, by 
Godel’s theorem on consistency proofs (Theorem 30 § 42) the second 
part cannot be. 
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It is interesting to note that, just as in the case of Gentzen’s 
consistency proof using transfinite induction up to e 0 , the present con¬ 
sistency proof can be analyzed as depending for its sole non-elementary 
step on the use of a predicate defined by an induction with quantifiers 
of both sorts entering in the induction step, namely (here) the truth 
predicate for number-theoretic formulas. We shall define this predicate 
next. 

Under the usual interpretation of the symbols 0,•, of the variables 
as natural number variables, and of the operations of building terms 
frcm them as corresponding to informal operations of explicit definition, 
any term t(x 1 , ..., x„) containing only the distinct variables x x , ..., x n 
expresses a primitive recursive function t(x v ...,x n ), or for n = 0 
a number t. Under the usual interpretation of =, then every prime 
formula P(x x , . .., x„) containing only x x , ..., x„ expresses a primitive 
recursive predicate P(x v ...,x n ), or for n = 0 a proposition P. For 
any closed prime formula P, the truth or falsity of P is determined (and 
effectively decidable) in our theory of primitive recursive functions, 
so we shall not elaborate upon this part of the truth definition. (Indeed, 
that theory would carry us somewhat further; cf. Example 4 below.) 

(A) From this as basis, we define ‘true’ as applied to any closed 
number-theoretic formula E, by induction on the number of (occurrences 
of) logical symbols in E. In this definition, of course “if” means “if and 
only if”, as is common in definitions. 

1 . A closed prime formula P is true, if P, i.e. if P is a true proposition 
in the theory of recursive functions. 

For Clauses 2 — 5 , A and B are any closed formulas. 

2 . A & B is true, if A is true and B is true. 

3. A V B is true, if A is true or B is true. 

4. A D B is true, if A is true implies B is true (i.e. when A is true 
only if B is true). 

5. -i A is true, if A is not true. 

For Clauses 6 and 7, x is a variable, and A(x) is a formula containing 
only x free. (Then when x is a natural number, x is the corresponding 
numeral, §41.) 

6 . 3xA(x) is true, if, for some natural number x, A(x) is true. 

7. VxA(x) is true, if, for every natural number x, A(x) is true. 
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(B) A number-theoretic formula A(y 1 , ..., y m ) containing free only 
the distinct variables y lf ..y m is true, if, for each m-tuple y lt ..., y m 
of natural numbers, A(y x , ..., y m ) is true. (We need not stipulate here 
that y x , ..., y m all occur free in A(y x , ..., y m ) or the order of occurrence, 
since if a formula is true for any one choice of the list y x , ..., y m , it is 
true for every other.) 

Example 4. Whether a formula E without variables is true or false 
(i.e. not true) can be decided using the 2-valued truth tables; and any 
formula A(x x> ..., x n ) without quantifiers and just x x , ..., x n as variables 
expresses a primitive recursive predicate A(x v ...,*„) such that 
A(x v ..., x n ) s {A(x x , .... x n ) is true}. (Cf. before Theorem 51 § 79.) 
By (A), (C) and (D) § 41: In the number-theoretic formal system every true 
formula without variables is provable, and every formula A(x x , ..., x„) 
without quantifiers numeralwise expresses the predicate A(x lt ..., x n ) 
which it expresses under the interpretation. 

Using this definition, we can establish the following theorem, in much 
the same manner as Theorem 21 §37, which corresponds to it for the 
predicate calculus. 

Theorem 61. (a) N If T f- E in the intuitionistic formal system of number 
theory, and the formulas T are true, then E is true. (b) c Similarly in the 
classical formal system of number theory. 

The only difference in the proofs of Parts (a) and (b) is that for (b) 
we need to use classical methods in the treatment of an axiom by the 
classical Axiom Schema 8. We label Part (a) with " N ” to indicate that, 
although the reasoning is intuitionistic, non-elementary methods are 
used; and Part (b) with “ c ” to indicate that non-intuitionistic classical 
methods are employed (cf. § 37). 

Since A and -iA cannot both be true, Theorem 61 (a) (for T empty) 
implies the simple consistency of the intuitionistic number theory, and 
thence by Corollary 2 Theorem 60 of the classical number theory, as an 
“ x ” result. The gain by Theorem 60 is that we do not have to call the 
latter a " c ” result, as we would in inferring it directly from Theorem 
61 (b). 

Example 5. (a) An V3-prenex formula, if true, is general recursively 

true (§ 79). For example, if Vv3w 0 3w 1 C(v, w 0 , w x ), where C(v, w 0 , w x ) con¬ 
tains no quantifiers and only the distinct variables shown, is true then 
(w){C(i>, w 0 {v), w^v)) is t} when w { {v) = (y.wC(v, ( w) 0 , Hi))., which is 
general recursive using #19 §45 and Theorem III §57. (b) N or c Hence 
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by Theorem 61 (a) or (b): In the number-theoretic formal system, every prov¬ 
able Vl-prenex formula is general recursively true. (Cf. Remark 2 § 79 .) 

Under our Godel numbering of the formulas, the predicate ‘A is true’ 
becomes a number-theoretic predicate T(a), the values of which can be 
given as propositions constructed from primitive recursive predicates 
by the operations of the propositional calculus and quantifiers, the 
number of the latter used being unbounded. We see from Theorem 30 
that this predicate T(a ) cannot be expressed, and its essential properties 
proved, in the system, as then we could formalize the above consistency 
proof in the system. (Cf. Hilbert-Bernays 1939 pp. 329—340.) 

In fact, each of the predicates (ExjT^a, a, x), (x)(Ey)T i (a, a, x, y), 
( Ex)(y)(Ez)T 3 (a , a, x,y, z), ... (cf. Theorem V Part II (b) § 57} is ex¬ 
pressible in the form T(^(a)) with a primitive recursive as can be seen 
by Corollary Theorem I § 49 (by which the formulas given by Corollary 
Theorem 27 to numeralwise express the predicates 7\, T 2 , T 3 , ... express 
them also under the interpretation) with Example 2 § 52. Therefore 
by Theorems VII (d) and V, T(a) is not arithmetical. 

Truth definitions for formal systems were originally investigated by 
Tarski ( 1932 ,-, 1933 ). He established that, if an (effective) formal system 
including the usual number theory is consistent, it must be impossible 
to express the predicate T(a) for the system by a formula T(a) so that 
T(«) - A 0 is provable in the system whenever a is the Godel number of 
a closed formula A 0 . For then the reasoning of the Epimenides paradox 
(§ 11 ) could be carried out in the system. (For more detail, see Hilbert 
and Bernays 1939 pp. 254—269.) 

The notions of truth for formulas intuitionistically and classically 
should differ. The above definition of truth however is phrased alike for 
the two, and any difference in the notions has to be made in our reading 
of the words used in the definition. In § 82, we shall give another truth 
definition, with a theorem for it corresponding to Theorem 61, which 
will apply selectively to the intuitionistic system. The first results, like 
Theorem 61 (a), will be intuitionistic though non-elementary (" N ”), 
but they lead to results which are metamathematical in the narrower 
sense. 

§ 82. Recursive realizability. Our problem is to express the 
interpretation of the intuitionistic number theory in a way which makes 
explicit some feature in which it differs from the classical. 

The meaning of an existential statement “(Ex)A(x)” for the intu- 
itionists has been explained by saying that it constitutes an incomplete 
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communication of a statement giving an x such that A(x) (Hilbert- 
Bernays 1934 p. 32). But “ A(x )” itself may in turn be an incomplete 
communication. Accordingly let us say that “( Ex)A(x )” is an incomplete 
communication, which is completed by giving an x such that A{x) to¬ 
gether with the further information required to complete the com¬ 
munication "A(x)” for that x. 

The idea can be extended to the other logical operations. For example, 
we can regard a generality statement “(x)A(x)” intuitionistically as an 
incomplete communication, which is completed by giving an effective 
general method for finding, to any x, the information which completes 
the communication “A(x)” for that x. 

Similarly, an implication “A -> B” can be regarded as an incomplete 
communication, which is completed by giving an effective general 
method for obtaining the information which completes “B ”, whenever 
that which completes “A” is given. 

Negation can be reduced to implication (cf. Example 3 § 74). 

Now effective general methods are recursive ones, when it is a natural 
number that is being given (§§ 60, 62, 63). Moreover, by the device of 
Godel numbering, information can be given by a number. 

Combining these ideas, we shall define a property of a number- 
theoretic formula which will amount to the formula’s being true under 
the interpretation suggested. However, instead of saying ‘true’, we shall 
say ‘(recursively) realizable’, to distinguish the property defined below 
from ‘truth’ as defined by using direct translations of the formal logical 
symbols by corresponding informal words (end §81). 

The interpretation of a term t(x x , ..., x n ) containing only x x , ..., x n 
free by a primitive recursive function t(x v ..., x n ), or for n = 0 by a 
number t, and the interpretation of a prime formula P(x 1 , ..., x n ) by 
a primitive recursive predicate P{x v ...,*„), or for n = 0 by a prop¬ 
osition P (end §81), do not differ intuitionistically from classically. 
We build upon this in setting up the definition of ‘realizability’ which 
interprets the logical operators intuitionistically as applied to number- 
theoretic formulas. 

First we define the circumstances under which a natural number e 
‘(recursively) realizes’ (or is a ‘realization number’ of) a closed number- 
theoretic formula E, by induction on the number of (occurrences of) 
logical symbols in E. 

(A) 1 . e realizes a closed prime formula P, if e = 0 and P is true 

(in other words, if e = 0 and P). 
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For Clauses 2 — 5, A and B are any closed formulas. 

2. e realizes A & B, if e = 2“-3 6 where a realizes A and b realizes B. 

3. e realizes A V B, if e = 2°-3 a where a realizes A, or e = 2 1 - 3 h where 
b realizes B. 

4. e realizes A D B, if e is the Godel number of a partial recursive 
function 9 of one variable such that, whenever a realizes A, then 9 (a) 
realizes B. 

5. e realizes nA, if e realizes A D 1 = 0 . 

For Clauses 6 and 7, x is a variable, and A(x) a formula containing 
free only x. 

6 . e realizes 3xA(x), He — 2 x -3 a where a realizes A(x). 

7. e realizes VxA(x), if e is the Godel number of a general recursive 
function 9 of one variable such that, for every x, <p(x) realizes A(x). 

Now we define ‘(recursive) realizability’ for any number-theoretic 
formula, thus. 

(B) A formula A containing no free variables is realizable, if there 
exists a number which realizes A. A formula A(y x , ..., y m ) containing 
free only the distinct variables y 1( ..., y m (ra :> 0 ) is realizable, if there 
exists a general recursive function 9 of m variables (called a realization 
function for A(y x , .... y m )) such that, for every y v 9 [y x , ..y,„) 

realizes A (y v ..., y m ). (Using §44, if a given formula is realizable 
for one choice of the y x , .... y m , it is for every other.) 

The handling of the free variables in the present definition of real¬ 
izability differs from that in Kleene 1945 . It simplifies the proof of the 
first theorem (Theorem 62), after which the equivalence of the two 
definitions will follow (by Corollary 1 ). 

The above definition of realizability refers only to our notion of number- 
theoretic formula, i.e. to the formation rules of our formal system. 

A modified notion of realizability, referring to the postulate list of the 
system, and to assumption formulas T if desired, is obtained by altering 
three clauses, as follows. Clause 3: replace “a realizes A” by " a realizes 
A and T h A”, and “b realizes B” by “b realizes B and Y \- B”. 
Clause 4: replace "a realizes A” by “a realizes A and T |- A”. Clause 6 : 
replace “a realizes A(x)” by “a realizes A(x) and Y j- A(x)”. For ‘real¬ 
izes’ [‘realizable’] in this modified sense we say realizes-{Y h) [ realizable - 

(r h)]. 
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Theorem 62 n . (a) If Y b E in the intuitionistic number-theoretic 

formal system, and the formulas Y are realizable, then E is realizable. 
(David Nelson 1947 Part I.) 

(b) Similarly reading “realizable-(T 1 -)” w* place of “realizable”. 

Lemma 44 n . If x is a variable, A(x) is a formula without free variables 
other than x, and t is a term without variables which hence expresses a number 
t, then e realizes A(t) if and only if e realizes A (t). 

Proof of Lemma 44. If A(x) is prime, then whether A(t) is true is 
equivalent to whether A(f) is true. Hence by Clause 1 , the lemma holds 
for a prime A(x). The lemma for any other A(x) follows from this basis 
by induction on the number of logical symbols in A(x), with cases cor¬ 
responding to the other clauses in the definition of ‘realizes’. 

Lemma 45 n . If E is a closed formula, then e realizes E if and only 
if e realizes the result of replacing each part of E of the form - 1 A where A is 
a formula by A D 1 — 0. 

Lemmas 44 and 45 also hold reading ‘T b” or “e realizes-{T b)” in 
place of “e realizes”, when b refers to the intuitionistic number-theoretic 
system, and Y are any formulas. (For Lemma 44 we then use (A) § 41 
with Theorem 24 (b) § 38.) 

Proof of Theorem 62. We state the proof for (a), and (optionally) 
the reader, by taking slight extra care, can verify that the additional 
conditions are met for (b). The proof is by induction on the length of the 
given deduction T b E, with cases corresponding to the postulates of 
our formal system. 

First we consider axioms. If A(y x , ..., y ro ) is an axiom containing as 
its only free variables y x , ..., y m , then by (B) to establish its realiz¬ 
ability we must give a general recursive function <p(y v ..., y m ) such that, 
for every w-tuple of natural numbers y v .... y m , the number 
<p(y v ..., y m ) realizes A(y x , ..., y m ). However, for each of the axiom 
schemata of the propositional calculus, we shall be able to find a number 
which realizes A(y x , .... y m ) for any axiom A(y x , ..., y m ) by the schema. 
It will suffice to give this number (which realizes the closed axioms by the 
schema), because when free variables y 1( ..., y m are present, we can take 
as <p(y x , ..., y m ) the constant function of m variables with this number 
as value (§ 44). Similarly for the particular number-theoretic axioms, 
we shall merely give a number which realizes the result of any sub¬ 
stitution of numerals for the free variables of the axiom. Similarly for 
Axiom Schema 13, we can give a realization number, as a general re- 
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cursive function of x, which depends only on the numeral x substituted 
for x; and for each of Axiom Schemata 10 and 11 one can be given, as 
a general recursive function of x v ..., x n , which depends only on the 
t and on the numerals x 1( ..., x„ substituted for its variables x lf ..., x„. 
Then when the y 1( ,,,,y m include other variables, the <p(y x , ..., y m ) 
can be obtained by expanding that function into a function of the 
required additional variables by use of identity functions (§ 44). 

For each of the axiom schemata and particular axioms (§§ 19, 23), 
we shall express our realization number or function using the notations of 
§ 65. The proof that it is a realization number or function, and the 
necessary verifications of recursiveness, are left to the reader in cases not 
discussed in detail. 

la. In accordance with the preliminary remarks, consider an axiom 
A 3 (B 3 A) by this schema containing no free variables. We show 
that AaAba, i.e. AaAb U\(a, b) (§44), realizes AD(BDA). For let 
a realize A; by Clause 4, we must show that (AaA6 a}(a), i.e. A b a (by 
(71) § 65), realizes B D A. To show this, let b realize B; we must show that 
(Ai a}(b), i.e. a, realizes A. But a does realize A, by hypothesis. 

lb. (A 3 B) 3 ((A 3 (B 3 C)) 3 (A 3 C)) is realized by 
ApAqAa {q(a)}(p(a)). For let p realize A 3 B; we must show that 
{ApAqAa{q(a)}(p(a))}(p), i.e. AqAa{q(a)}(p(a)), realizes (A 3 (B 3 C))3 
(A 3 C). To show this, let q realize A 3 (B 3 C); we must show that 
Aa {q(a)}(p(a)) realizes A 3 C. To show this, let a realize A; we must 
show that {?(«)}(£(a)) realizes C. Now by hypothesis, p realizes A 3 B 
and a realizes A; hence p{a) realizes B. Moreover q realizes A 3 (B 3 C), 
and a realizes A; so q(a) realizes B 3 C. But now q(a) realizes B 3 C, 
and p(a) realizes B; hence {q(a)}{p(a)) realizes C, as was to be shown. 

3. A 3 (B 3 A & B). AaAb 2°-3 6 . 

4a. A & B 3 A. Ac (c) 0 (cf. #19 § 45). 4b. A & B 3 B. Ac (c) v 
5a. A 3 A V B. Aa2°-3 a . 5b. B 3 A V B. A6 2»-3*. 

6. (A 3 C) 3 ((B 3 C) 3 (A V B 3 C)). 

ApAqAr x(p, q, r) where 

X(p, q, r) ~ [p{{r)x) if (r) 0 = 0, q{(r)^) if (r) 0 = 1], using Theorem XX (c). 
Suppose p realizes A D C, q realizes BdC, and r realizes A V B; we 
must show that x(P> <1> r ) realizes C. Case 1 : r — 2°-3 a where a realizes A. 
Then (r) 0 = 0 and (r) 2 = a. Since p realizes A D C and (r) 1 realizes A, 
p((r)i) realizes C. But (r) 0 = 0; so (and because p((r ) x ) is defined) 
X(P> 7- r ) = Pii r ) i). an( i so it realizes C, as was to be shown. Case 2 : 
r — 2 U 3 6 where b realizes B. Similarly. 
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7. (A3B)D((A3-iB)D-iA). Using Lemma 45, the number 
which realizes the closed axioms by Axiom Schema lb (in particular 
those with 1=0 as the C) realizes those by this schema. 

8 1 . -iA D (A D B). 0. For if p realizes -iA, then by Clause 5, p 
realizes A D 1=0. But then no number a can realize A, since p[a) would 
realize the false closed prime formula 1=0, contradicting Clause 1. Thus 
vacuously, if p realizes ->A and a realizes A, then (0(/»)}(a) realizes B. 

(The reader may find it instructive to verify that there is no apparent 
way to treat the classical Axiom Schema 8.) 

10. Let the t for the axiom contain exactly the distinct variables 
x v ..., x„ (n ;> 0); denote it as "tfxj, ..., x n )”, and let t(x v ..., x n ) 
be the primitive recursive function (or for n = 0, the number) which 
it expresses. By the preliminary remarks, we suppose the axiom contains 
free only x 1( ..., x„; if none of x v .. x n is x, let it be VxA(x, x 1( 
..., x„) D A(t(x 1 , ..., x„), x x , ..., x„). Since t(x lf ..., x„) is free for x 
in A(x, Xj, ..., x„), the result of substituting numerals x 1( ..., x n 
for (the free occurrences of) x x , ..., x„ in the axiom is VxA(x, x v .... x n ) 
D A(t(x 1 , ..., x„), x v ..., x„). We shall show that the number 
A p p{t{x 1 , ..., x n )), which as x v ..., x n vary is a general (in fact, primi¬ 
tive) recursive function of x v ..., x n , realizes this formula. By Clause 4, 
for this purpose we must show that, if p realizes VxA(x, x x , ..., x„), 
then P(t(x v ..., x n )) realizes A(t(x x , ..., x B ), x x , ..., x„). But, if p 
realizes VxA(x, x x , ...,x n ), then by Clause 7, p{t(x v ...,*„)) realizes 
A (t, x v ..., x„) where t = t(x v ..., x n ); and hence by Lemma 44, 
p{t(x v ..x n )) also realizes A(t(x x , ..., x„), x v ...,xj. — If say x x is x, 
the axiom is Vx 1 A(x 1 , ..., x„) D A(t(x x , . .x„), x 2 , ..., x„), etc. 

11. A(t(x x , ..., x B ), x x , ..., x B ) D 3xA(x, x v .... x n ). 

A a 

13. A(0) & Vx(A(x) 3 A(x')) D A(x). We treat the case that the 
A(x) contains free only x, as the preliminary remarks will then take care 
of the general case. Let a partial recursive function p(x, a) be defined by a 
primitive recursion thus, 

f p(0, a) = (a) 0 , 
l P(x', a) ~ {{(a) 1 }(x)}(p(x, a)). 

Now we show'that for every x the number A a p(x, a), which is a primitive 
recursive function of x, realizes A(0) & Vx(A(x) D A(x')) D A(x). To 
do so (Clause 4), we prove by induction on x that, if a realizes 
A(0) & Vx(A(x) D A(x')), then p(x, a) realizes A(x). Basis. If a realizes 




§82 


RECURSIVE REALIZABILITY 


507 


A(0) & Vx(A(x) 3 A(x')), then by Clause 2, p(0, a) [= (a) 0 ] realizes A(0). 
Ind. step. Similarly («) x realizes Vx(A{x) 3 A(x')), and hence (Clause 7) 
{(u)j}(x) realizes A(x) 3 A(x'). But by hyp. ind., p(x, a) realizes A(x). 
Hence (Clause 4), p(x', a) [= {{(a) x }(#)}(p(#, «))] realizes A(x'). 

14. After substitution of numerals, we have from this axiom 
a'=b' 3 a=b. This formula is realized by A p 0. For suppose p realizes 
a’=b'. We must show that then 0 realizes a=b. Since a'=b’ is prime, it 
is only realizable if it is true, i.e. if a’ = b’. Then a = b, so a=b is also 
true, and 0 realizes it. 

Similarly, for the other particular axioms, after substituting numerals, 
we have realization numbers as follows. 

15, 18 — 21: 0. 16: ApAq 0. 17: Ap 0. 

Rules of inference. 2 . We take advantage of the remark accompa¬ 
nying the definition of realizability to regard the formulas as each de¬ 
pendent on all of the variables occurring free in any of them. Thus we 
write the rule 

MYv_ • • •, y w ) A(y x , ■ • • > y») => B(y 1 , ..., y m ) 

B(y 1; ...,y»). 

By hypothesis of the induction, the premises A(y x , ..., y m ) and 
A(y x , .... y m ) 3 B(y x , ..., y m ) are realizable, i.e. there are general 
recursive functions a and vp, such that, for every w-tuple of natural 
numbers y x , ..., y m , A(y x , ..., y m ) is realized by the number a(y x , ..., y m ) 
and A(y x , ..., y m ) 3 B(y x , ..., y m ) by the number 4>(y x , ..., y n ). 
Then the number {^(y x , ..., y m )}(a(y x , ..., y m )) realizes B(y x , . ..,y m ). 
Moreover, {<p(y x , .... y m )}(x(y x , ..., y m )) is obviously a partial recursive 
function of y x , ..., y m . But its value is a realization number for every 
y x , .... y m , so it must be defined for every y x , ..., y m ; thus it is general 
recursive. Thus the conclusion B(y x , .... y m ) is realizable. 

9 - C(y x , ...,y m ) 3 A(x, y x , ...,y m ) 

C(y x , • • •, y m ) ^ VxA(x, y x , ..., y m ). 

By the hypothesis of the induction and the definition of realizability, 
there is a general recursive function such that, for every x, y x , ..., y m , 
yi> • • •, y«) realizes C(y x , ...,y m ) A(x, y v .... y m ). We shall 
prove that, for every y x , ...,y m , AcAx{^{x, y x , ..., y m )}(c) realizes 
C(y x , ..., y m ) 3 VxA(x, y x , ..., y m ). This will give the realizability of 
the conclusion, since Ac Ax {^(x, y x , .. .,y- m )}(c) is a primitive recursive, a 
fortiori general recursive, function of y v ..., y m . Accordingly suppose that 
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c realizes C(y v ...,y m ); we must show that Ax {^(x, y v ..., y m )}(c) 
realizes VxA(x, y v ..., y m ). To do this, we must show that, for every x, 
{ty(x, y„ ...,y m )}(c) realizes A(x,y v ...,y m ). But since c realizes 
C and by hyp. ind. ty(x, y v ..., y m ) realizes C(y v .... y m ) 3 

A (x,y lt ...,y m ), {i{x,y lt ...,y m )}(c) does realize A (x,y lt ...,y m ). 
(Note how this treatment would break down, if the C contained x free, 
call it “C(x, y 1( ..., y m )”. Then, we would have to assume that c realizes 
C(x, y v ..., y m ) for some x, and we could conclude only that 
Vi* • •y m )}(c) realizes A(x, y v ...,y m ) for that x, whereas we 
would need to conclude it for every x.) 

12. A(x, y v . ,.,y m ) 3 C(y x> ...,y OT ) 

3xA(x, y x , ..., y m ) 3 C(y x , .... y m ). 

Similarly, using A p {<j;((/>) 0 , y v ..y m )}({p)i) as realization function for 
the conclusion, given that is for the premise. 

The theorem includes the simple consistency of the intuitionistic 
formal system of number theory (by using (a) with T empty and 1 = 0 
as the E), as does Theorem 61 (a). The additional interest in Theorem 
62 in this connection stems from the different condition on new axioms 
T under which it is shown that the simple consistency is preserved (as 
we shall discuss further following Theorem 63). 

Corollary 1 n . If y v .... y m are distinct variables, and A(y x , .... y m ) 
is a formula, then A(y x , ..., y m ) is realizable, if and only if 
Vy x ... Vy m A(y x , ..., y m ) is realizable. 

For A(y x , ..., y m ) and Vy x ... Vy m A(y 1 , ..., y m ) are interdeducible 
in the intuitionistic formal system. 

This corollary (applied to the case y 1( ..., y m are the free variables 
of the given formula in order of first free occurrence) gives the equivalence 
of the present version of the definition of realizability (Kleene 1948 ) 
to that of Kleene 1945 . 

Corollary 2 n . (a) If F are realizable formulas, A(x x , ..., x n , y) is a 

formula containing free only the distinct variables x x , ..., x n , y, and 
r h 3yA(x x , ..., x n , y) in the intuitionistic number-theoretic formal 
system, 'then there is a general recursive function y — <p(x 1 , ..., x n ) such 
that, for every x v .... x n , A(x v ...,x n ,y) (where y = <p(x lt .... *„)) 
is realizable. 

(b) Similarly reading in place of “realizable” any one of the following 
combinations of properties: (i) “realizable-(F )-) and deducible from T”, 
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(ii) "realizable-(T (-), deducible from T, and true ”, (iii) "realizable-(Y b)> 
deducible from T, and realizable”, (iv) "realizable-^T b), deducible from T, 
true and realizable”. 

Proofs, (a) By (a) of the theorem with (B) and (A) 6 of the definitions, 
(b) (i) Using instead (b) of the theorem, (ii) Using further Theorem 61 (a) 
to infer that A(x 1( ..., x n , y) is true, (iii) Using further (a) of the theorem 
to infer that A(x x , .... x n , y) is realizable. 

Realizability is intended as an intuitionistic interpretation of a formula; 
and to say intuitionistically that A(Xj, ..., x n , y) is realizable should 
imply its being intuitionistically true, i.e. that the proposition 
A (x v ..., x„, y) constituting its intuitionistic meaning holds. The formula 
3yA(Xj, ..., x n , y) asserts the existence, for every x lt ..., x n , of a y 
depending on x lt ..., x n , such that A (x v ..., x n , y ); or in other words, 
the existence of a function y = <p(x v ...,*„) such that, for every 
x v ...,x n , A(x v y(x v ...,x n )). By (a) of the corollary for T 

empty, that formula can be proved in the intuitionistic formal system, 
only when there exists such a 9 which is general recursive. In brief, 
only number-theoretic functions which are general recursive can be 
proved to exist intuitionistically. (We are here considering the assertion 
of the existence of a function value y{x v ...,x n ) for all n -tuples 
x v ...,%„ of arguments, so this is not in conflict with our use intu¬ 
itionistically of partial recursive functions.) 

This result as inferred from (a) depends on accepting the thesis that 
the realizability of A(x x , ..., x n , y) implies its truth. However by 
using (b) for T empty (in which case, since we have no hypothesis on 
T to satisfy, we may take the strongest form (iv) in the conclusion, i.e. 
that A(Xj, ..., x„, y) is realizable-(b), provable, true and realizable), 
we obtain the same result independently of that thesis. 

The presence of the T in the corollary shows that the result will hold 
good upon enlarging the formal system by any suitable axioms I\ If 
the thesis that realizability implies truth, intuitionistically, is accepted, 
these need only be realizable. Otherwise they should be realizable-(r b) 
and true (deducibility from P holds automatically in the hypothesis on T). 

The result provides a connection between Brouwer’s logic as formalized 
by Heyting and Church’s thesis (§ 62) that only general recursive functions 
are effectively calculable. Both developments arose from a constructivistic 
standpoint, but were previously unrelated in their details. 

The formula 3yA(x 1( ...,x„, y) does not assert the uniqueness of 
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the function y = <p(x v ..x n ) such that A(x v ..x n , 9 (* x , .. x n )); 
for this we need 3!yA(x x , ...,x n , y) (§41). 

Classically, given the existence of some function 9 such that, for all 
x v ...,x n , A(x v ...,x n , <p(x lt ..., x n )), the least number principle 
provides formally a method of describing a particular one (*149 § 40, 
*174b § 41). While we do not have the least number principle intuition- 
istically, we do know by Corollary 2 that, whenever a particular intu- 
itionistic proof of a formula of the form 3yA(x x , ..., x„, y) is given, we 
can on the basis of that proof describe informally a particular general 
recursive function 9 ^, ..., x n ) such that, for all x v ..x n , A(x v ...,x n , 

?(*i> •••.*«))• 

Example 1 . (Cf. Example 8 (c) §74.) Let S x be the intuitionistic 
number-theoretic system. Let A(x, y) be a formula containing free only 
x and y. Suppose that for each x, the formula A(x, y) is true for exactly 
one y. Then when (to obtain S 2 ) we introduce f with the axiom A(x, f(x)), 
the axiom characterizes f as expressing a certain function 9 under the 
interpretation. By 3-introd. from the new axiom, p 2 3yA(x, y). Now 
suppose f with the axiom A(x, f(x)) is eliminable. Then |- x 3yA(x, y). 
Then by Theorem 62 Corollary 2 (b) (ii) with T empty, there is a general 
recursive function y = 9 x (x) such that, for each x, A(x, y) is true. But then 
9 X = 9 . Thus in the intuitionistic number-theoretic system, a new function 
symbol f ( expressing a function 9 ) introduced with an axiom of the form 
A(x, f(x)), where A(x, y) contains free only x and y, and A(x, y) is true 
exactly when y = <?{x), is eliminable only when 9 is general recursive. 

Example 2. Let A(x, y) be any formula, containing free only x and y, 
such that hi 3yA(x, y). Then as in Example 1 , there is a general re¬ 
cursive function y — 9 x (x) such that, for each x, A(x, y) is true. The 
demonstration of this (consisting mainly in the proof of Theorem 62 (b)) 
is constructive; given a proof of 3yA(x, y) (or the Godel number of such 
a proof), we can find a system E of equations defining a 9 X recursively 
(or a Godel number of 9 X ). Also it is effectively decidable whether a 
number a is the Godel number of a proof of a formula of the form 
3yA(x, y) where A contains free only x and y (Case 1), or not (Case 2). Let 



a Godel number of 9 X , in Case 1, 

Ax x (i.e. a Godel number of 17}), in Case 2, 


where ambiguity as to which Godel number of which 9 X (or which Godel 
number of 17}) is chosen is removed by suitable conventions. Then 6(a) 
is effectively calculable. So by Church’s thesis we may expect that 0(a) 
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is general recursive. (In fact, it is easy to prove that 0(a) is primitive 
recursive, after establishing: (1) There is a primitive recursive function 
£(a) such that, if a is the Godel number of a proof in the intuitionistic number- 
theoretic system, then 1(a) is a Godel number of a realization-(\~) function 
.. - ,y m ) for the endformula A(y 1( ..., y m ), where y lf . .., y m are the 
free variables of the endformula in order of occurrence in our list of the 
variables.) Let y(x) — {0(*)}(%) +1. Then y(x) is general recursive. Now 
let A(x, y) be a formula such that A(x, y) is true exactly when y — <p(x) 
(e.g. one which numeralwise represents cp, cf. Theorem 32 (a) § 59). If 
we now take this formula as the A(x, y) of Example 1, we are led to a 
contradiction by supposing that f with the axiom A(x, f(x)) is eliminable. 
Thus: (2) There is a general recursive function <p such that, in the intu¬ 
itionistic number-theoretic system, a new function symbol f expressing <p with 
an axiom of the form A(x, f(x)), where A(x, y) contains free only x and y, 
and A(x, y) is true exactly when y = y(x), is not eliminable (and HyA(x, y) 
is not provable for any such A(x, y)). 

Theorem 63 n . For suitably chosen formulas A(x), B(x) and C(x, y). 
the following classically provable formulas are unrealizable and hence 
(by Theorem 62 (a)) improvable in the intuitionistic formal system of 
number theory. (Specifically, let A(x, z) numeralwise express the predicate 
Tp(x, x, z) of § 57, using Corollary Theorem 27 § 49. Let A(x) be 3zA(x, z), 
B(x) be A(x) V -iA(x) and C(x, y) be y=l V (A(x) & y= 0 ).) 

(i) A(x)V-iA(x). 

(ii) Vx(A(x) V-i A(x)) (the closure of (i)). 

(iii) -i-iVx(A(x) V-iA(x)) (the double negation of (ii)). 

(iv) Vx -v-iB(x) D -i-iVxB(x). 

(v) -ri{Vx -i-iB(x) ID -i—iVxB(x)} (the double negation of (iv)). 

(vi) 3yC(x, y) ID 3y[C(x, y) & Vz(z<y D -iC(x, z))] (cf. *149 § 40). 

(vii) 3 y[y<w & C(x, y) & Vz(z<y ID -iC(x, z))] V Vy[y<w D ~<C(x, y)] 
(cf. *148). 

Also the closure, and the double negation of the closure, of (vi) and of (vii). 
((i) — (v): Kleene 1945 with Nelson 1947 .) 

Lemma 46 n . (a) If A is realizable, and B is unrealizable, then A ID B 

is unrealizable. Hence: If A is realizable, then -iA is unrealizable, (b) If 
A is closed and unrealizable, then A ID B and (hence) - 1 A are realizable, and 
(by (a)) tiA is unrealizable. 
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Proof of Lemma 46. (a) By Theorem 62 (a) or the case for Rule 2 

in its proof, if A and A 3 B are realizable, so is B. (b) For a closed B, 
any number, e.g. 0, realizes A 3 B, since vacuously, whenever a realizes 
A (i.e. never), 0(a) realizes B. 

Lemma 47 n . If P(x x , ...,x„) numeralwise expresses a general re¬ 
cursive predicate P(x v ..x n ) in the intuitionistic formal system of number 
theory, then, for every x v ..., x„, P(x x , ..., X„) is realizable if and only if 
P{%1> • • • > %n) • 

Proof of Lemma 47. If P(x v ..., x n ), then by § 41 (i), |- P(x x , ..., x n ), 
and hence by Theorem 62 (a), P(x 1; ...,x n ) is realizable. Conversely, 
suppose P(x x , ..., x n ) is realizable. Because P(x v .. x n ) is a general 
recursive predicate, we have (constructively) that, for the given 
x v ..x n , either P{x 1 , ..x n ) or P(x v ...,x n ). In the latter case, 
however, by §41 (ii), b “iP(x x , . ..,x„), and hence by Theorem 62 
(a), “iP(x 1 , ...,x n ) is realizable, which by Lemma 46 (a) contradicts 
our supposition that P(x x , ..., x n ) is realizable. 

Proof of Theorem 63. (i) Suppose (i), i.e. 3zA(x, z) V-j3zA(x, z), 

were realizable. Let y{x) be a realization function for it; and set 
p{x) = (<p(x)) 0 . Then p(x) is general recursive, and takes only the values 
0 and 1 (by (B) and (A) 3 of the definitions). Consider any fixed x. Case 1 : 
p(x) = 0. Then (<p(x)) x realizes 3zA(x, z); and hence (<p(x)) x x realizes A(x, z) 
where z = (<p(x)) x 0 , in which case by Lemma 47, T x (x, x, z ). Thus 
{Ez)T x {x, x, z). Case 2: p(x) = 1. Then (<p(x)) x realizes -i3zA(x, z), i.e. 
(q>(x)) x realizes 3zA(x, z) 3 1=0. We shall show that then (Ez)T x {x, x, z). 
For if there were a z such that T x (x, x, z), by Lemma 47 A(x, z) would be 
realizable; say k realizes it. Then 2*-3 fc would realize 3zA(x, z); and 
{(<p(x)) x }(2*-3*) would realize 1=0, which is impossible. The two cases 
show that the general recursive function p(x) is the representing function 
of (Ez)T x (x, x, z). But (Ez)T 1 (x, x, z) is non-recursive ((15) Theorem 
V § 57); hence no such general recursive p(x) can exist. By reductio ad 
absurdum, therefore (i) is unrealizable. 

(ii), (iii). By V-elim. (i) is deducible intuitionistically from (ii), so 
by Theorem 62 (a) also (ii) is unrealizable; and by Lemma 46 (b) so is 

(iii), since (ii) is closed. 

(iv) Since (iii) can be deduced from (iv), using *5la § 27 and V-introd. 

(vi) We show as follows that (i) is deducible from (vi). From 1 = 1 
(which is provable) by V- and 3-introd., 3yC(x, y). Using this, from (vi) by 
3-elim., 3y[C(x, y) & Vz(z<y 3-iC(x, z))]. Preparatory to &- and 



§82 


RECURSIVE REALIZABILITY 


513 


3-elim., assume C(x, y), i.e. 

(1) y=l V (A(x) &y=0) 

and Vz(z<y D-iC(x,z)), i.e. 

(2) Vz(z<y D-i{z=lV (A(x) & z=0)}). 

We use proof by cases from (1) to deduce (i) with the help of (2). 
Case 1 : assume y=l. For reductio ad absurdum, assume further 
A(x). From this and 0=0, by &- and V-introd., 0=1 V (A(x) &0=0). 
But also from y=l by ♦l35b, 0<y; and thence from (2) by V-elim. 

(with 0 as the t) and D-elim., —i{0= 1 V (A(x) & 0=0)}. Hence by re¬ 

ductio ad absurdum, -iA(x). By V-introd., A(x) V -iA(x), which is (i) and 
does not contain free the variable y of our proposed 3-elim. Case 2 : assume 
A(x) & y=0. By &-elim. and V-introd., A(x)V-iA(x). (This deduction 
is related to the intuitive reasoning of Example 6 § 64.) 

(vii) From (vii) we can deduce (vi), as in the proof of *149 from *148. 

Theorem 63 (i) — (v) imply that G? V -i G? is unprovable in the intu- 
itionistic propositional calculus, and 'ix{^T(x) V-i<G?(x)), 

—i—y^x{TT{x) V-i<C?(x)), Vx-i-ic3(*) D-i-iVxd?(x) and 
—i— i{Vjc -!-i <^(*) D -i-iVx<G?(x)} in the intuitionistic predicate calculus, 
as we already knew from Theorem 57 (b) and Theorem 58 (a) and (c). 
The present proofs are less elementary than those based on Gentzen’s 
normal form theorem, but contribute insight into the working of the 
intuitionistic logic as an instrument for number-theoretic reasoning. 
We succeed in showing AV-iA unprovable in intuitionistic number 
theory only in the presence of a free variable x. 

Corollary (to (ii)) N . The formula- iVx(A(x) V-iA(x)) ( although the 
negation of a classically provable formula ) is realizable. 

By (ii) and Lemma 46 (b). 

The formula Vx(A(x) V -iA(x)) is classically provable, and hence under 
classical interpretations true. But it is unrealizable. So if realizability 
is accepted as a necessary condition for intuitionistic truth, it is untrue 
intuitionistically, and therefore unprovable not only in the present 
intuitionistic formal system, but by any intuitionistic methods whatsoever. 

This incidentally implies that our classical formal system reinforced 
by an intuitionistic proof of simple consistency cannot serve as an in¬ 
strument of intuitionistic proof, as suggested in § 14, except of formulas 
belonging to a very restricted class (including those of the forms B(x) 
and VxB(x) end §42, but not the present formula Vx(A(x) V-tA(x))). 
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The negation iVx(A(x) V -iA(x)) of that formula is classically untrue, 
but (by the corollary) realizable, and hence intuitionistically true, if we 
accept realizability (intuitionistically established) as sufficient for intu- 
itionistic truth. 

So the possibility appears of asserting the formula -nVx(A(x) V -iA(x)) 
intuitionistically. Thus we should obtain an extension of the intuitionistic 
number theory, which has previously been treated as a subsystem of the 
classical, so that the intuitionistic and classical number theories diverge, 
with-iVx(A(x) V -i A(x)) holding in the intuitionistic and Vx(A(x) V-iA(x)) 
in the classical. 

Such divergences are familiar to mathematicians from the example of 
Euclidean and non-Euclidean geometries, and other examples, but are 
a new phenomenon in arithmetic. The first example comes by adjoining 
A P (p) or -i A P (p) to the number-theoretic formalism, cf. end §§42 and 75. 

Not only is the formula -i Vx(A(x) V -iA(x)) itself realizable, but by 
Theorem 62 (a) (taking it as the T), when we add it to the present intu¬ 
itionistic formal system, only realizable formulas become provable in the 
enlarged system. So then every provable formula will be true under the 
realizability interpretation. In particular, the strengthened intuitionistic 
system is thus shown by interpretation to be simply consistent. 

A fuller discussion is given in Kleene 1945 , where the proposed ad¬ 
junctions to the unstrengthened intuitionistic formal system of number 
theory S, to obtain a strengthened intuitionistic system S' diverging 
from the classical S c , are in the form of an identification of truth with 
realizability. 

Refinements of the results which we are basing here on interpretation 
are obtained by Nelson 1947 Parts II—IV (with Kleene 1945 ). Because 
they all involve the consistency of the number-theoretic formalism, no 
completely elementary treatment can be expected. But the non-elemen- 
tariness is minimized in the results based on this further work of Nelson 
to the full extent that the results are proved in elementary metamathe¬ 
matics under the hypothesis of the simple consistency of S. In particular, 
by these results with those of Godel 1932-3 (cf. Corollary 2 Theorem 60), 
it is demonstrated metamathematically that both S' and S c are simply 
consistent if S is. (Nelson takes as his 5 not our intuitionistic formal 
system but one obtained, apart from an inessential difference in the 
equality postulates, by adjoining to ours some additional function 
symbols with their defining equations. These equations fit our schemata 
(I) — (V) § 43 or closely similar schemata, except that also a certain 
schema of course-of-values recursion is allowed. Using Nelson’s (i) — (iv) 
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p. 332, to each application of that schema a pair of equations having the 
same form with f, g, h, t* replaced by f \ g', h\ t- is provable without the 
application; so the course-of-values recursion schema is eliminable. Then 
by Example 9 § 74 with the remarks preceding it, the additional function 
symbols are eliminable.) 

Nelson 1949 introduces a notion of ‘P-realizability’, using which one 
can set up a number-theoretic system diverging from both the strength¬ 
ened intuitionistic and the classical. 

Gene Rose 1952 investigates realizability in relation to the intu¬ 
itionistic propositional calculus. 

Kleene 1950 a plans the use of recursive functions in interpreting 
intuitionistic set theory. 

Example 3. (a) The operators 3, - 1 , &, V applied to closed formulas 

A and B obey the strong 3-valued truth tables (§ 64, restated with the present 
symbols), when t, f, u are read as ’realizable’, ‘unrealizable’, ‘unknown (or 
value immaterial)’, respectively, i.e. the tables then give only correct 
information about the realizability or unrealizability of A 3 B, -iA, 
A & B, A V B, when entered from such information about A and B. Proof. 

A 

Consider 3. If B is realizable, then by *11 § 26 with Theorem 62 (a), 
so is A 3 B, corresponding to the three t’s in Column 1 of the table for 3. 
If A is unrealizable, then by Lemma 46 (b), A 3B is realizable, corre¬ 
sponding to the three t’s in Row 2. If A is realizable and B is unrealizable, 
then by Lemma 46 (a), A 3 B is unrealizable, corresponding to the f in 
Row 1 Column 2. The table for -1 is simply the f column of that for 3; 
and & and V are easily treated, (b) A formula without variables is realizable, 
if and only if it is true. Its realizability (and truth) or unrealizability 
(and falsity) is thus effectively decidable by the valuation procedure 
furnished by the usual interpretation of 0, ', +, •, = and the classical 
2-valued truth tables for 3, - 1 , Sc, V (cf. § 79 before Theorem 51). 
Proof by using Example 4 § 81, or thus: For closed prime formulas, truth 
and realizability agree, and can be decided. In building thence composite 
formulas by the operations of the propositional calculus, we always 
remain within the first two rows and columns of the 3-valued tables, 
(c) We call a number e an R -valuation number of a closed formula E, if 
either e = 2°-3 €l (then e x = (e) x ) and e x realizes E, or e = 2 1 -3° and E is 
unrealizable. For open formulas, R -valuation function is defined in analogy 
to 'realization function’. A formula C(z 1 , ...,z m ) containing no quanti¬ 
fiers and only the distinct variables z v .... z m (m > 0) has a primitive 
recursive R-valuation function y (z v .. .,z m ). Proof (omitting “z v .... z m ” 
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to save space). Case 1: C is a prime formula P. Then P expresses a prim¬ 
itive recursive predicate P, with representing function <p. Let y = 2 < P*3°. 
Case 2: C is A D B, where by hyp. ind. there are primitive recursive 
R-valuation functions a and p for A and B, respectively. Let 


T = 


2°‘[3 exp Aa (p)J if (p) 0 = 0, 
2°‘3° if (a)„=((3)»=l, 

2 1 *3° otherwise. 


Case 3: -iA. Take p = 2 1 -3° in Case 2. Case 4: A & B. Let 
J 2°- [3 exp 2<«>i-3<^l if («) 0 =(p) 0 = 0, 

T \ 2 1 -3° otherwise. 

Case 5: A V B. Similarly, (d) A prenex formula is realizable, if and only 
if it is general recursively true. (Cf. Remark 2 § 79 and Example 5 § 81.) 
Proof. Consider again the formula G used as illustration in § 79. Let 
a(y v x v y t , x t , y 3 ) be a primitive recursive R-valuation function for 
A(y 1( x v y 2 , x 2 , y 3 ). Then if G is recursively true, it is realized by 

2*'»-[3 exp 2 Vt(Xl) ‘ [3 exp Ax 2 2 V3(I1 -* 2) -3 exp 
(“Oh. Xv x s , y 3 (x v * a )))J]. 

Conversely, if g realizes G, then G is recursively true with y x = (g) 0 , 

Vtix i) = ({(g)iX*i))o ^ yz( x v x 2 ) = ({({(g)iX«i))xX^2))o as the required 
number and general recursive functions. 

Example 4. Under the thesis that ( x)(Ei)B(x , i) holds intuition- 
istically only if there is a general recursive a such that (x)B(x, x(x)) (cf. 
above, or Kleene 1943 p. 69 Thesis III), we show that 

(a) (y)(Ei) i<a A(i, (y)<) -> {Ei) i<a {y)A(i, y) 

does not hold intuitionistically for all A (cf. (20) § 57). Using (51) § 61, 

(x)(y)m t<9 W t {x, (y),). 

Thence if we had (a) intuitionistically, we would get (x)(Ei) i <2 iy)W t {x, y) 
intuitionistically; whence by the thesis, for some recursive a taking 
values <2, {x){y)W aix) (x,y); whence (x){Ey)W^ (x) {x, y); whence for 
each x, 

(b) (Ey)W 3 {x, y) -> a(x)= 1, {Ey)W 1 (x, y) -> a(x)=0. 

But we cari take a(*)= 1 as the R 0 {x, y) and a(^)=0 as the R x (x, y) of 
(57) and (58) §61. Then either a(/) = 1 or a(/) = 0. If a(/) = 1 ( then 
(Ey)R 0 (f , y), whence it follows as in § 61 that ( Ey)W x (f , y), contradicting 
(b). Similarly, if a(/) = 0. 
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63, 69, 98, 107, 175, 176, 178, 179, 204, 
211, 212, 225, 245, 246, 272, 321, 389, 
394, 397, 403, 407, 415, 416, 424, 435, 
436, 463, 472, 475, 476, 478, 498, 501. 
Bernstein, F. 11. 

Berry’s paradox 39. 

Black, M. 46. 

blank square (Turing machine) 357. 
Bolyai, J. 28. 

Boole, G. 61. 

Boone, W. W. 386, 536. 

bound, upper etc. 31; — term 410; 

— variable 76, 153; cf. bounded, free, 
bounded quantifiers 197, 202, 225, 228. 

285, 329, 336, 471, 516. 
branch 107; principal 267. 

Brouwer, L. E. J. 43, 46, 53, 56, 57, 58, 
318, 491, 498, 509 
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Burali-FortEs paradox 36, 40, 45, 532. 
Bussey, W. H. 54. 

calculable: cf. effectively, reckonable. 
calculation: problem 137, 314, 316, cf. 
Church's thesis; procedure cf. algo¬ 
rithm. 

canonical system (Post) 320. 

Cantor, G. 3, 6, 9, 15, 16, 30, 36, 40, 46, 
48, 477; — 's diagonal method 6 , 14, 
205, 207, 272, 282, 320, 341, 382; — *s 
theorem 15, 425; — 's paradox 36, 
37, 40, 42, 45, 532. 
cardinal number 3, 9, 44. 

Carnap, R. 45, 63, 83, 250. 

cases, definition by 229; induction 186; 

proof by 98, 100, 182, 190. 
categorical axioms 27, 430, 432. 

Cauchy, A. L. 30. 

Cayley, A. 54. 

Cervantes 40. 

chain: of equivalences etc. 117; in¬ 
ference 113, 114. 

change of variables 105, 153, 161. 
choice, axiom of (Zermelo) 52, 394. 
Church, A. 34, 98, 180, 241, 279, 300, 
301, 319, 320, 322, 324, 325, 382, 432, 
434, 436, 517; — 's theorem 301, 315, 
382; — s thesis 300, 313, 314, 316, 

317, 331, 352, 354, 376, 381, 382, 509, 

converse of 300, 319. 

class cf. set. 

classical vs. intuitionistic (logic, mathe¬ 
matics, methods) 46, 48, 51, 53, 56, 63, 

318, (formal logic, formal systems) 
101, 140, 442, 480, 492, 501, cf. in¬ 
tuitionistic. 

closed formula 151. 
closure 151. 
cognate sequents 480. 
combinatory definability 321. 
commutative laws 118, 186. 
comparability theorem 16. 
compatible formal systems 437. 
complement of a set 10. 
complete: equality, equivalence 328, 
331, 335, 340; formal system cf. 
completeness; — set (Post) 343. 


completed infinity cf. actual, 
completely defined function etc. 326. 
completeness (concepts) 131, 135, 176, 
194, 212, 266, 302, 304, 309, 326, 422, 
(results) 132, 134, 135, 139, 204, 295, 
304, 389, 393, 395, 397, 400, 422, 424, 
474, cf. incompleteness, 
computable function 320, 321, 360, 362; 

1/1, w/s 361; cf. Turing, 
concatenation 71. 

conclusion (deduction) 87, 107, (im¬ 
plication) 113, 138, (inference) 83. 
condition (Turing machine) 357. 
conditional: equation 149; interpreta¬ 
tion 150. 

configuration (Turing machine) 357. 
congruent formulas 153. 
conjunction (interpretation) 51, 69, 138, 
503, (formal logic) 69, 113, 118, 122, 
134, 166, 177, cf. introduction, prop¬ 
ositional connectives, truth tables, 
conjunctive normal form 135. 
connexity 188. 

consequence 82, cf. deducibility, 
consistency (concepts) 53, 124, 131, 207, 
212, 266, 297, 302, 304, 305, 309, 314, 
326, 391, 422, 440, 475, 479, 498, (re¬ 
sults) 128, 129, 172, 174, 176, 204, 
304, 422, 425, 466, 470, 472, 474, 497, 
500, 508, 514, cf. Godel’s second 
theorem; — in a system 391; — 
theorem 463, 466, 469, 475. 
consistent interpretability 437. 
constant 150; — function 220, 223; 
logical, non-logical 436; variable held 
96, 102, 140, 148, 179. 
constructive: continuum 43; existence 
proofs 49, 52, 501, 509; method 26, 
53; infinity cf. potential; ordinals 
325; cf. intuitionistic. 
continuous functions 18, 164. 
continuum 17, 30; constructive 43; — 
problem 16, 41 (Godel); cf. analysis, 
contraction (quantifiers) 285, (Gentzen) 

443, 444. 

contradiction 119, cf. consistency, 
contraposition 113. 
contributory deduction 267. 
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convergence 32, 164. 
correctness cf. consistency, 
correspondence 3, 9, 32, 255, 426. 
countable set 3, cf. enumerable, 
course-of-values: function 231, 291; in¬ 
duction 22,193; recursion 231,237,286. 
creative set (Post) 346. 
crocodile, dilemma of the 39. 

Curry, H. B. 154, 321, 459, 460, 492. 
cut (Gentzen) 443, 453; cf. Dedekind. 

Dantzig. D. van 38, 497. 

Davis, M. 315. 

decidable; closed formula 194, 201; cf. 
effectively, numeralwise, resolvable, 

un—. 

decimal notation 6, 19, 30, 381. 
decision problem (concepts) 136, 137, 
301, 313, 314, 316, 382, 435, 437, 
(results) 137, 204, 301, 313, 314, 383, 
386, 407, 423, 432, 438, 439, 474, 
482; special cases of the 435; cf. 
Church's thesis, degree, reducibility. 
decision procedure cf. algorithm. 
Dedekind, R. 14, 30, 43, 46, 48, 242; 

— cuts 30, 36, 43, 52. 
deducibility 87, 88, 264, cf. deduction, 
deduction (formal) 87, 94, 106, 249, 264, 
278, 290; given, resulting 94; — 
schema 87; — theorem 90, 97, 98, 
140, 179; cf. variation, 
deductive: method 59, 60; rules 80. 
defined: notions 28; value 326. 
degree of unsolvability 315, 316, 343, 

386, 435. 

Dekker, J. C. E. 527. 

De Morgan, A. 61, 119. 
denumerable set 3, cf. enumerable, 
dependence (deduction) 95, 102, (func¬ 
tion) 220, 224, 354. 
depth 115, 151, 185. 
derived rules 86, 407, cf. introduction. 
Descartes, R. 17, 54. 
descending induction 193, 478. 
description 199; eliminability of 407; 

indefinite 346. 

designation 71, 250, 251. 
diagonal method cf. Cantor. 


dictionary (semi-group) 383. 
difference (numbers) 223, (sets) 10. 

Diophantus 48. 

direct: clause 20, 259; rule 86, 94. 
discharge 94, 181. 
disjoint sets 10. 

disjunction (interpretation) 50, 69, 138, 
225, 503, (formal logic) 69, 113, 118, 
122, 134, 166, 177, 190, cf. intro¬ 
duction, propositional connectives, 
truth tables. 

disjunctive normal form 134, 136. 
distributive laws 118, 186. 
divisibility 191, 202, 230, 241. 

Dixon, A. C. 38. 

domain 9, 32; (system, valuation) 24, 
28, 168, 172, 174, 389, 400, 427, 463. 
double: basis 193, 232; negation 99, 
119, cf. intuitionistic formal systems; 

recursion 273. 
dual: fraction etc. 16, 30, 361, 381; 

-converse 124, 167; cf. duality, 
duality (logic) 122, 123, 167, 285, 443, 
(projective geometry) 56. 

effective interpretation 464. 
effectively: calculable function, decid¬ 
able predicate 299, 300, cf. Church's 
thesis, decision problem; enumerable 
set of formulas 398; true formula 465. 
Einstein, A. 57. 
element of a set 9. 

elementary: axiom system 421, 424; 
axioms 421; conjunction etc. 134; 
function (Kalmar) 285; number theory 
30, 51, 69; predicate 284, 291, (Kal¬ 
mar) 285. 

eliminability 405, 419, 510, 515. 
elimination: laws 118; of logical sym¬ 
bols 98, cf. introduction; relations 
405, cf. eliminability; theorem 453, 
460, cf. Gentzen. 
empty: conjunction etc. 441; expression 
70, 78, 382; range 226; set 9. 
endformula etc. 87, 107, 264. 
entity 247, 251, 259, 276. 
enumerable set 3, 427, cf. enumeration, 

recursively. 
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enumeration 4, 426; effective 398; — 
theorem 281, 341. 
Epimenides' paradox 39, 42, 45, 205, 501. 
equality (notions) 9, 13, 20, 127, 172, 
247, 328, (formal logic) 183, 197, 399, 
409, 418, 424, (recursiveness) 227, 
329; axioms for 399, 403, 424; — 
and predicate letter formula 399. 
equation 264, 277; system of —s 264, 

276, 277. 

equivalence (notions) 9, 113, 225, 320, 
328, 334, 383, 386, 401, 406, (formal 
logic) 114, 116, 151, 161, (recursive¬ 
ness) 228, 329, 337; — classes 9, 
401; — relation 401, 424; — theorem 

11 . 

erasure (Turing machine) 358. 
essential undecidability 437. 

Eubulides' paradox 39, cf. Epimenides. 
Euclid 27, 53, 60, 136, 191; —'s theorem 
on primes 191, 230, 286; cf. geometry, 
excluded middle, law of the (informal) 
47, 52, 57, 175, 196, 281, 296, 318, 
333, 394, (formal) 119, 120, 134, 190, 
191, 192, 483, 487, 511, 513. 
exclusive: disjunction 138; predicates, 
relations 11, 229. 

exhaustive: predicates, relations 11. 
existence 49, 70, 225, 501, 509; unique 
199, 225, 408; cf. introduction, pred¬ 
icate calculus, quantifiers, 
existential: quantifier 73; cf. actual, 
formal axiomatics. 
explicit: definition 220, 406; occurrence 

etc. 156, 160. 

exponents 222, 230. 
exportation 113. 

extension (function) 324, 338, (system) 
89, 94, 130, 310, 405, 428, 436, 514. 
external inconsistency 212. 
extremal clause 20, 259. 

falsity 125, 127, 500, cf. truth. 

Fermat's “fast theorem" 47, 50, 138, 

435. 

Feys, R. 141. 
fields 439, 

finitary methods 63, 479, 498. 


finite : axiomatizability 436, cf. 427; 
cardinal 12, 44; domain 168, 178, 400, 
435, 464; extension 437; sequence 5, 
70; set 12, 14; cf. finitary. 
Finsler, P. 212. 
f-less transform 411, 417, 419. 
formal axiomatics 28, 41, 53, 60, 421. 
formal calculation 194, 241, 262, 295, 

322. 

formal deduction 87. 

formal expression 70. 

formal implication 138. 

formal induction 181, cf. induction. 

formal inference 83. 

formal mathematics 53, 62. 

formal objects 62, 64, 70, 85, 249, 251, 

265, 

formal proof 65, 83, 85, 249, 254, 299. 
formal symbols 69, 249, 265. 
formal system (concept) 62, 69, 246, 265, 
299, 306, 323, 422, 434, 436, (examples) 
cf. eliminability, Gentzen, number 
theory, predicate calculus, proposi¬ 
tional calculus; — for a number- 
theoretic predicate (Thesis II and 
converse) 301, 304, 323, 404; cf. 

formalism, formalization, 
formal theorem 83, 85. 
formal theory 62, 65. 
formalism 43, 46, 59, 204, 479; a — cf. 
formal system; number-theoretic, cf. 
number theory; -— of recursive 
functions 263, 276. 
formalization 53, 59, 69, 210, 245, 298, 
396, 422; cf. formal system, for¬ 
malism. 

formation rules 72. 
formula 61, 72, 137, 248, 252, 441. 
F-quantifier 412. 
fraction 4, 56, cf. decimal, dual. 
Fraenkel, A. 16, 40, 45, 426, 517. 
free: 76, 253; for 79, 253; substitution 
80, 156; at substitution positions 79; 
term 79, 253, 410; variable 76, 148, 

161. 

Frege, G. 9, 43, 44, 61, 88, 250. 
f-term 410. 

function: 32, (notation) 33, 34; as a — 
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354; letter 263, 266, 276, 277; symbol 
70, 263, 403, 407, 417, 464. 
functional 234, 275, 326, 362. 
fundamental: inductive definition 258; 

theorem of arithmetic 230. 

Galileo’s “paradox” 3, 14, 46. 
gap (Turing machine) 364. 

Gauss, C. F. 26, 30, 48, 52, 56, 230. 
general properties of \~ 89, 104, 444. 
general recursive: class 307; function 

274, cf. recursive function; functional 

275, interpretation 464, predicate 276, 
cf. recursive predicate; scheme 275; 

set 307; truth 465, 500, 516. 
generality 49, 69, 225, 502; — inter¬ 
pretation 149; — quantifier 73; cf. 
introduction, predicate calculus, quan¬ 
tifiers. 

generalized arithmetic 246, 259, 276, 

290. 

genetic method 26, 53. 

Gentzen, G. 37, 69, 89, 100, 141, 225, 
440, 445, 453, 460, 463, 476, 478, 479, 
480, 495; —s consistency proof for 
number theory 476, 498, 499; —'s 
Hauptsatz (normal form or elimi¬ 
nation theorem) 440, 450, 453, 460, 
475, 476, 479, 492, 513, extended 
460, 463, 475; —type systems 441, 
460, 478, G1 442, G2 450, G3 480, 

G3a 481. 

geometry (analytic, Cartesian) 17, 54, 
(Euclidean, non-Euclidean) 17, 27, 40, 
41, 54, 55, 430, 514, (foundations) 28, 
60, 475, (projective) 55, 439. 
given function letter 266. 

Glivenko, V. 492. 

Godel, K. 16, 41, 46, 141, 204, 211, 212, 
213, 221, 225, 227, 239, 240, 241, 246, 
274, 317, 320, 321, 326, 389, 393, 397, 
398, 400, 425, 437, 486, 493, 495, 497, 
514; —’s (3-function 240, 243; —’s 
completeness theorem 389, 393, 394, 
397, 400, 422, 423, 427, 430, 436; —’s 
(incompleteness, undecidability) the¬ 
orem 204, 207, 211, 258, 274, 287, 304, 
308, 426, 430, 431, 514, generalized 


form of 302, 308, 430, 431, Rosser's 
form of 208, 308, symmetric form of 
(and W v W t ) 308, 309, 316, 332, 470, 
516; — numbering 206, 246, 254, 276, 
281, 288, 290, 296, 300, 313, 322, 381, 
386, 394, 398, 431, 434, 501, 502, 510, 
cf. recursive functions, Turing, —'s 
reduction of classical to intuitionistic 
systems 211, 493, 495, 497, 514; —'s 
second theorem (on consistency 
proofs) 210, 305, 476, 478, 479, 498, 

501. 

Gonseth’s paradox 38. 
group 29, 439; semi- 382, 386. 

Hall, M. 386. 

Hasenjaeger, G. 398. 

Hausdorff, F. 16. 
height 107. 

Henkin, L. 389, 432, 492. 

Herbrand, J. 98, 154, 179, 274, 326, 440, 

460, 463. 

Hermes, H. 246. 

Heyting, A. 51, 52, 57, 140, 166, 487, 
491, 492, 497, 509, 517. 
Hilbert, D. 26, 28, 43, 53, 55, 57, 58, 
61, 63, 136, 271, 318, 415, 424, 478; 
— Ackermann cf. Ackermann; — Ber- 
nays cf. Bernays; — type system 441. 
hypothesis of the induction 22. 

ideal: elements 55; statements (Hilbert) 
55, 213, 475, 513. 

idempotent laws 118. 
identical: act, schema etc. 267, 362, 380, 
461; equation, truth etc. 127, 149, 

172. 

identity, cf. equality; — function 220; 

principal of 113. 
imaginary (complex) numbers 56, 475. 
immediate: consequence 83, 254, 277; 

dependent 220. 
implication (interpretation) 51, 69, 138, 
141, 225, 498, 502, (formal logic) 69, 
113, 118, 124, 154, 167, cf. intro¬ 
duction, propositional connectives, 
truth tables. 


importation 113. 
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impredicable (paradox) 38. 
impredicative definition 42, 44. 
improper subset 10. 
inclusive disjunction 138. 
incompletely defined function 326. 
incompleteness (results) 207, 208, 213, 
274, 302, 309, 416, 427, 430, 511, of. 

Godel's theorem, 
inconsistency, of. consistency, 
indefinite description 347. 
independent variable 32. 
index in an enumeration 4. 
indirect proofs 49, 52, 497. 
individual 29, 44, 145, 180; — symbols 
70, 162, 403, 407, 419, 421, 464; 

— variables 145. 
induction 21, 44, 46, 93, 181, 193, 247, 
259, 274, 427, 429, 432, 476, 498; 

— axiom 20, 429, 432; — cases 186; 
definition by 217, 287, cf. recursion; 

— number 22, 478; — postulate (or 
schema) 181, restricted 204, 472, 
generalized 476; — predicate (or 
proposition) 22, 430, 479; — rule 

181; — step 22; — variable 22. 
inductive definition 20, 258, 306. 
inequalities 118, 187, 197, 225, 229. 
inference 83; rules of 60, 83. 
infinite: cardinal 13; descent 193, 478; 

sequence 7, 8, 17; set 13, 14. 
infinity, axiom of 46, 425; points at 
56; problem of 46; cf. actual, po¬ 
tential. 

informal: 41, 62, 64, 69, 85, 181; 

axiomatics 28, 41, 53; induction 181; 
mathematics 62, 69; presentation 181; 

symbolism 225; theory 65. 
initial : function 219; state etc. 357; 367. 
input (Turing machine) 367. 
integer 3, 4, 56, 439, 530, 532. 
interchange 121, (Gentzen) 443, 444; — 
of premises 113, 
interdeducible formulas 151, 154, 398, 

435. 

interpretation 57, 64, 69, 125, 130, 138, 
143, 175, 194, 212, 303, 421, 430, 478, 
499, 501; — of variables 146, 149, 227. 
intersection of sets 10, 16. 


introduction and elimination of logical 
symbols 98, 102, 106, 147, 148, (Gent¬ 
zen) 442, 451, 480; strong 105; weak 
negation elimination 101; cf. varia¬ 
tion. 

intuitionism 43, 46, 53, 56, 59, 63, 318, 
497, 513, cf. intuitionistic. 
intuitionistie: informal logic, mathema¬ 
tics, meanings 48, 281, 291, 318, 333, 
336, 497, 501, 516; formal logic, formal 
systems 51, 101, 114, 119, 120, 140, 
163, 165, 190, 192, 194, 414, 442, 444, 
479, 486, 487, 492, 504; — set theory 
52, 491, 515. 

intuitive, cf, informal, intuitionism. 
inverse laws 186. 

Iongh, J. J. de 492, 497, 498. 
irrational numbers 17, 31, 56. 
irredundant proof 482. 
irreflexiveness 188, 
irrefutable formula 389, 423. 
isomorphic systems 25. 

Jankowski, S. 98, 141. 
juxtaposition 71. 

Kalmdr, L. 135, 285, 286, 287, 518. 
Kemeny, J. G. 432. 

A-equality 172, 400. 

A-fold recursion 273. 

A-identity 172, 178, 400. 

Kleene, S. C. 88, 135, 261, 274, 275, 
279, 281, 282, 284, 286, 287, 288, 293, 
302, 304, 306, 307, 312, 320, 322, 324, 
325, 326, 335, 337, 350, 353, 355, 382, 
460, 487, 492, 503, 508, 511, 514, 515, 

516. 

Klein, F. 54. 

^-predicate calculus 177. 

A-proposition letter formula 177. 

A-recursive function 273. 

Kreisel, G, 431, 

Kronecker, L. 19, 46, 320 (SJ), 498. 
A-transform 177, 178. 

Kuznecov, A. V. 289. 

lambda-definability 320, 382. 

Langford, C. H. 45, 141, 335. 
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language 61, 69, 249, 265. 
lattices 439, 492. 

least number: operator (e) 317, 324, 
414, (p) 225, 228, 279, 289, 295, 323, 
327, 329, 330, 347, 350, 414; principle 
189, 200, 511. 

least upper bound 31, 43. 

Leibnitz, G. W. v. 43, 61. 

letter: formula in =, P lt . .. P s 399; 

ra-tuple 132; in a semi-group 382. 
Lewis, C. I. 138, 141, 335. 
liar, the (paradox) 39, cf. Epimenides. 
limit 32, 348. 
linear order 29. 

Liouville, J. 7. 

Lobatchevsky, N. I. 28, 54. 
logic 46, 51, 60, 69, 80, cf. formal 
system, formalization, predicate cal¬ 
culus, set-theoretic, 
logical: constant 436; function 169, 389; 
notion 28, 60; operator 73; rules 442; 

symbol 69. 

logicism 43, 59, 180. 
logistic method 61, cf. logicism. 
Lowenheim, L. 136, 394, 427, 436; —'s 
(or the —Skolem) theorem 394, 398, 
400, 425, 426, 427, 428, 436. 
Lukasiewicz, J. 140, 335. 
lying Cretan, the 39, cf. Epimenides. 
Lyndon, R. C. 532. 

machine cf. Turing. 

MacLane, S. 154. 
major premise 264. 

Mannoury, G. 38, 498. 

Markov, A. A. 289, 382, 386. 
material: axiomatics 28, 41, 53; im¬ 
plication 138. 
mathematical: induction cf. induction; 

logic cf. formal system. 
McKinsey, J. C. C. 141, 522. 
meaning (classical mathematics) 57, 478, 
(functional notation) 33, 227 (logical 
symbolism) 225, cf. interpretation, 
member of a set 9. 

memory (Turing machine) 356, 366, 379. 

Meray, C. 30. 
meta- 63. 


metamathematical: letters, symbols, 
variables 62, 70, 140, 250, 265; def¬ 
initions, functions, predicates, re¬ 
cursion 72, 80, 251, 258, 276, 290; 
induction, proofs, theorems 85, 181. 
metamathematics 55, 59, 62, 63, 69, 80, 
85, 175, 176, 423; arithmetization of 
246, 276, 290, 396; cf. formal system, 

formalism. 

metatheory 62, 65. 

midsequent 460. 

minor premise 264. 

mix 450, 453; — formula 450. 

modal logic 141. 

model 25, 53, 422, 463, 475. 

modus ponens 98. 

Mostowski, A. 284, 287, 293, 297, 304, 
321, 430, 432, 437, 439, 492, 534. 
motion (Turing) 357, 358, 379, 380. 
multiplication cf. product, 
mutually exclusive relations 11. 

name 71, 250, 265. 

name form: 142, 146; interpretation 
146, 227; occurrence 157; prime — 
161; — replacement 161; variables 
142, 143, 146. 

n -ary relation 144. 

natural number 3, 12, 19, 44, 48, 51, 
217, 427, 432, 530, cf. number theo¬ 
ry* 

negation (interpretation) 51, 69, 138, 
225, 498, 502, (formal logic) 69, 101, 
113, 118, 121, 166, 407, cf. introduc¬ 
tion, intuitionistic, propositional con¬ 
nectives, truth tables. 
Nelson, D. 211, 223, 409, 487, 492, 504, 

511, 514, 515. 

nested recursion 271. 

Neumann, J. von 9, 40, 53, 57, 140, 204, 
425, 437, 440, 463, 472. 
non-arithmetical predicates 287, 292, 

315, 479, 501. 
non-constructive : logics 321, 431; proofs 

49, 52. 

non-enumerable sets 6, 16, 175, 423, 425. 
non-Euchdean geometry cf. geometry, 
non-intuitionistic methods 48, 51. 
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non-recursive : functions 324; predicates 
283, 310, 316, 332, 382. 
normal form, conjunctive etc. 134, 136; 

— for proofs (Gentzen) 440, 453, 460; 

— for recursive functions 288, 292, 

330; Skolem 435; cf. prenex. 
normal systems (Post) 320. 
notions 28. 

«-tuple 5, 17, 33. 

number 3, 6, 9 ; 12, 14, 16, 19, 29, 56, 

476. 

number theory 29, 51; formal 69, 82, 97, 
98, 181, 241, 252, 258, 295, 305, 310, 
314, 395, 406, 407, 414, 419, 427, 
492, 495, 501; — with restricted in¬ 
duction schema 204, 472, 476; without 
• 204, 407, 474; cf. analytic, Robin¬ 
son’s system. 

number variable 70, 263. 
number-theoretic : formula 108; function 
7, 17, 33, 38, 198, 217, 414, 509; 
predicate 226, 389; cf. number theory, 
numeral 177, 195, 252, 254, 255, 263, 
278; generalized 465. 
numeralwise: decidable formula 196, 
297; expressible predicate 195, 200, 
206, 244, 296, 298; representable 
function 200, 243, 295, 298. 
n-valued logic 140, 179, (for n = 3) 

332, 335, 515. 

object 24, 144; function as an 354; 

— language 63; system of —'s 24; 

— theory 62, 65; — variable 144; cf. 

formal objects. 

observation (Turing) 356, 378, 381. 
occurrence 70, 155, 441. 
omega-completeness 212. 
omega-consistency 207, 212, 304. 
one-place predicate calculus 398, 436. 
one-to-one correspondence 3, 9, 426. 
open formula 151. 
operation 32, ^358. 
operator, formal 73; logical 73. 
order 29, (cardinals) 10, (natural num¬ 
bers) 13, 21, 187, (reals) 31; first, 
second, higher 180, 421; within a 

type 44. 


ordinals 16, 476; constructive 325. 
ordinary notions 28, 60. 
oscillation, function of large 289. 
output (Turing machine) 358, 367. 

pairing of parentheses 23, 73. 
paradoxes 36, 40, 42, 45, cf. Epimenides, 
Galileo, Richard, Skolem, Zeno, 
parameters 34, 150; — in recursion 218, 

235, 342, 344. 
parentheses 23, 70, 73, 250. 
partial function 325, 332, 341, 351. 
partial order 29, 106. 
partial predicate 327. 
partial recursive: function 326, cf. re¬ 
cursive function; functional 326; pred¬ 
icate 328; scheme 326. 

Pasch, M. 61. 

passive: state etc. 357, 367. 

Paul (the apostle) 39. 

Peano, G. 20, 43, 61; —'s axioms 20, 
26, 186, 429, 432, (generalized) 247, 
cf. induction. 

Peirce, C. S. 61, 534. 

Peter, R. 223, 231, 271, 272, 273, 395, 

476 

plain f-term 410. 

Poincare, H. 30, 42, 46. 
point set 9, 18. 

Poncelet, J. V. 55. 
positive integer 3, 530, 532. 

Post, E. 135, 140, 284, 289, 293, 301, 
306, 307, 315, 320, 321, 343, 346, 356, 
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predicate letter: 142, 179; formula 143, 
equality and 399, 400, in 156, k- 177, 
with numerals 390; propositional cal¬ 
culus 109; interpretation of —s 145, 
168,174,421. 
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286. 
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redundant: axioms, postulates 406. 
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syntax language 63. 

systems: of equations 264, 276, 277; 


of objects 24, 53, 421, 436, 463; cf. 
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